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Preface 


Today, a firm understanding of mathematics is essential for any serious student of economics. 
Students of economics need nowadays several important mathematical tools. These include 
calculus for functions of one or several variables as well as a basic understanding of 
optimization with and without constraints, e.g. linear programming plays an important 
role in optimizing production programs. Linear algebra is used in economic theory and 
econometrics. Students in other areas of economics can benefit for instance from some 
knowledge about differential and difference equations or mathematical problems arising in 
finance. The more complex economics becomes, the more deep mathematics is required and 
used. Today economists consider mathematics as the most important tool of economics and 
business. This book is not a book on mathematical economics, but a book on higher-level 
mathematics for economists. 

Experience shows that students who enter a university and specialize in economics vary 
enormously in the range of their mathematical skills and aptitudes. Since mathematics is 
often a requirement for specialist studies in economics, we felt a need to provide as much 
elementary material as possible in order to give those students with weaker mathematical 
backgrounds the chance to get started. Using this book may depend on the skills of readers and 
their purposes. The book starts with very basic mathematical principles. Therefore, we have 
included some material that covers several topics of mathematics in school (e.g. fractions, 
powers, roots and logarithms in Chapter 1 or functions of a real variable in Chapter 3). So 
the reader can judge whether or not he (she) is sufficiently familiar with mathematics to be 
able to skip some of the sections or chapters. 

Studying mathematics is very difficult for most students of economics and business. 
However, nowadays it is indeed necessary to know a lot of results of higher mathematics 
to understand the current economic literature and to use modern economic tools in practical 
economics and business. With this in mind, we wrote the book as simply as possible. On the 
other hand, we have presented the mathematical results strongly correct and complete, as is 
necessary in mathematics. The material is appropriately ordered according to mathematical 
requirements (while courses, e.g. in macroeconomics, often start with advanced topics such 
as constrained optimization for functions of several variables). On the one hand, previous 
results are used by later results in the text. On the other hand, current results in a chapter 
make it clear why previous results were included in the book. 

The book is written for non-mathematicians (or rather, for those people who only want 
to use mathematical tools in their practice). It intends to support students in learning the 
basic mathematical methods that have become indispensable for a proper understanding 
of the current economic literature. Therefore, the book contains a lot of worked examples 
and economic applications. It also contains many illustrations and figures to simplify the 
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mathematical techniques used and show how mathematical results may be used in economics 
and business. Some of these examples have been taken from former examinations (at the 
Otto-von-Guericke University of Magdeburg), and many of the exercises given at the end of 
each chapter have been used in the tutorials for a long time. In this book, we do not show how 
the mathematical results have been obtained and proved, but we show how they may be used 
in real-life economics and business. Therefore, proofs of theorems have been skipped (with 
a few exceptions) so that the volume of the book does not substantially exceed 500 pages, 
but in spite of the relatively short length the book includes the main mathematical subjects 
useful for practical economics and an efficient business. 

The book should serve not only as a textbook for a course on mathematical methods for 
students, but also as a reference book for business people who need to use higher-level 
mathematics to increase profits. (Of course, one will not increase profit by solving e.g. 
a differential equation, but one can understand why somebody has increased profits after 
modelling a real process and finding a solution for it.) One of the purposes of this book is to 
introduce the reader to the most important mathematical methods used in current economic 
literature. We also provide an introduction to the close relationship between mathematical 
methods and problems arising in the economy. However, we have included only such 
economic applications as do not require an advanced knowledge of economic disciplines, 
since mathematics is usually taught in the first year of studies at university. 

The reader needs only knowledge of elementary mathematics from secondary school to 
understand and use the results of the book, i.e. the content is self-sufficient for understanding. 
For a deeper understanding of higher mathematics used in economics, we also suggest a small 
selection of German and English textbooks and lecture materials listed in the literature section 
at the end of the book. Some of these books have been written at a comparable mathematical 
level (e.g. Opitz, Mathematik; Simon and Blume, Mathematics for Economists; Sydsaeter 
and Hammond, Mathematics for Economic Analysis) while others are more elementary in 
style (e.g. Misrahi and Sullivan, Mathematics and Finite Mathematics; Ohse, Mathematik 
fiir Wirtschaftswissenschaftler; Rosser, Basic Mathematics for Economists). The booklets 
(Schulz, Mathematik für wirtschaftswissenchaftliche Studiengänge; Werner, Mathematics 
for Students of Economics and Management) contain most important definitions, theorems 
of a one-year lecture course in mathematics for economists in a compact form and 
they sketch some basic algorithms taught in the mathematics classes for economists at 
the Otto-von-Guericke University of Magdeburg during recent decades. Bronstein and 
Semandjajew, Taschenbuch der Mathematik, and Eichholz and Vilkner, Taschenbuch der 
Wirtschaftsmathematik, are well-known handbooks of mathematics for students. Varian, 
Intermediate Microeconomics, is a standard textbook of intermediate microeconomics, where 
various economic applications of mathematics can be found. 

The book is based on a two-semester course with four hours of lectures per week which 
the first author has given at the Otto-von-Guericke University of Magdeburg within the last 
ten years. The authors are indebted to many people in the writing of the book. First of 
all, the authors would like to thank Dr Iris Paasche, who was responsible for the tutorials 
from the beginning of this course in Magdeburg. She contributed many suggestions for 
including exercises and for improvements of the contents and, last but not least, she prepared 
the answers to the exercises. Moreover, the authors are grateful to Dr Günther Schulz for 
his support and valuable suggestions which were based on his wealth of experience in 
teaching students of economics and management at the Otto-von-Guericke University of 
Magdeburg for more than twenty years. The authors are grateful to both colleagues for their 
contributions. 
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The authors also thank Ms Natalja Leshchenko of the United Institute of Informatics 
Problems of the National Academy of Sciences of Belarus for reading the whole manuscript 
(and carefully checking the examples) and Mr Georgij Andreev of the same institute for 
preparing a substantial number of the figures. Moreover, many students of the International 
Study Programme of Economics and Management at the Otto-von-Guericke University of 
Magdeburg have read single chapters and contributed useful suggestions, particularly the 
students from the course starting in October 2002. In addition, the authors would like to 
express their gratitude to the former Ph.D. students Dr Nadezhda Sotskova and Dr Volker 
Lauff, who carefully prepared in the early stages a part of the book formerly used as printed 
manuscript in IATEX and who made a lot of constructive suggestions for improvements. 

Although both authors have taught in English at universities for many years and during 
that time have published more than 100 research papers in English, we are nevertheless 
not native speakers. So we apologize for all the linguistic weaknesses (and hope there are 
not too many). Of course, for all remaining mathematical and stylistic mistakes we take 
full responsibility, and we will be grateful for any further comments and suggestions for 
improvements by readers for inclusion in future editions (e-mail address for correspondence: 
frank.werner@mathematik.uni-magdeburg.de). Furthermore, we are grateful to Routledge 
for their pleasant cooperation during the preparation ofthe book. The authors wish all readers 
success in studying mathematics. 

We dedicate the book to our parents Hannelore Werner, Willi Werner, Maja N. Sotskova 
and Nazar F. Sotskov. 


F.W. 
Y.N.S. 
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1 Introduction 


In this chapter, an overview on some basic topics in mathematics is given. We summarize 
elements of logic and basic properties of sets and operations with sets. Some comments on 
basic combinatorial problems are given. We also include the main important facts concerning 
number systems and summarize rules for operations with numbers. 


11 LOGIC AND PROPOSITIONAL CALCULUS 


This section deals with basic elements of mathematical logic. In addition to propositions and 
logical operations, we discuss types of mathematical proofs. 


1.1.1 Propositions and their composition 
Let us consider the following four statements А, B, C and D. 


A Number 126 is divisible by number 3. 

B Equality 5 · 11 = 65 holds. 

C Number 11 is a prime number. 

D OnlJuly 1000 it was raining in Magdeburg. 


Obviously, the statements A and C are true. Statement B is false since 5- 11 = 55. Statement 
D is either true or false but today probably nobody knows. For each of the above statements 
we have only two possibilities concerning their truth values (to be true or to be false). This 
leads to the notion of a proposition introduced in the following definition. 


Definition 1.1 A statement which is either true or false is called a proposition. 


Remark For a proposition, there are no other truth values than ‘true’ (T) or ‘false’ (F) 
allowed. Furthermore, a proposition cannot have both truth values ‘true’ and ‘false’ (principle 
of excluded contradiction). 


Next, we consider logical operations. We introduce the negation of a proposition and con- 
nect different propositions. Furthermore, the truth value of such compound propositions is 
investigated. 


2 Introduction 


Definition 1.2 A proposition A (read: not A) ) is called the negation of proposition A. 
Proposition A is true if A is false. Proposition A is false if A is true. 


One can illustrate Definition 1.2 by a so-called truth table. According to Definition 1.2, the 
truth table of the negation of proposition A is as follows: 


A|T|F 
А|Е\Т 


Considering the negations of the propositions A, B and C, we obtain: 


A Number 126 is not divisible by the number 3. 
B Equality 5 · 11 = 65 does not hold. 
C The number 11 is not a prime number. 


Propositions А and C are false and B is true. 


Definition 1.3 The proposition A ^ B (read: A and В) is called a conjunction. 
Proposition A ^ B is true only if propositions A and B are both true. Otherwise, A ^ B 
is false. 


According to Definition 1.3, the truth table of the conjunction A ^ B is as follows: 


by 
[S N 


ТЕЕ 
ЕТЕ 
ЕЕЕ 


Definition 1.4 The proposition AV B (read: A or B) is called a disjunction. Proposition 
A V B is false only if propositions A and В are both false. Otherwise, A v B is true. 


According to Definition 1.4, the truth table of the disjunction A v B is as follows: 


A T\|T|F\F 
B T FIT|F 
AVB|T|T\|T\|F 


The symbol v stands for the ‘inclusive or’ which allows that both propositions are true (in 
contrast to the ‘exclusive or’, where the latter is not possible). 
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Example 1.1 Consider the propositions M and P. 


M In 2003 Magdeburg was the largest city in Germany. 
P  In2003 Paris was the capital of France. 


Although proposition P is true, the conjunction M ^ P is false, since Magdeburg was not 
the largest city in Germany in 2003 (i.e. proposition M is false). However, the disjunction 
M v P is true, since (at least) one of the two propositions is true (namely proposition P). 


Definition 1.5 The proposition А ==> B (read: if A then B) is called an implication. 
Only if A is true and B is false, is the proposition A => B defined to be false. In all 
remaining cases, the proposition A => В is true. 


According to Definition 1.5, the truth table of the implication A => В is as follows: 


A TIT|F|F 
B TIF|T|F 
A—BITIFI|TI|T 


For the implication А => В, proposition A is called the hypothesis and proposition B is 
called the conclusion. An implication is also known as a conditional statement. Next, we 
give an illustration of the implication. A student says: If the price of the book is at most 
20 EUR, I will buy it. This is an implication A => B with 


A The price ofthe book is at most 20 EUR. 
B The student will buy the book. 


In the first case of the four possibilities in the above truth table (second column), the student 
confirms the validity of the implication А => B (due to the low price of no more than 
20 EUR, the student will buy the book). In the second case (third column), the implication is 
false since the price of the book is low but the student will not buy the book. The truth value 
of an implication is also true if A is false but B is true (fourth column). In our example, this 
means that it is possible that the student will also buy the book in the case of an unexpectedly 
high price of more than 20 EUR. (This does not contradict the fact that the student certainly 
will buy the book for a price lower than or equal to 20 EUR.) In the fourth case (fifth column 
of the truth table), the high price is the reason that the student will not buy the book. So in 
all four cases, the definition of the truth value corresponds with our intuition. 


Example 1.2 Consider the propositions A and В defined as follows: 


A The natural number л is divisible by 6. 
B The natural number л is divisible by 3. 


We investigate the implication A => B. Since each of the propositions A and B can be true 
and false, we have to consider four possibilities. 
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If n is a multiple of 6 (i.e. n € {6,12,18,...}), then both A and B are true. According 
to Definition 1.5, the implication A == В is true. If n is a multiple of 3 but not of 6 
(i.e. n € {3,9,15,...}), then A is false but В is true. Therefore, implication A => В is true. 
If n is not a multiple of 3 (i.e. n € {1,2,4,5, 7,8, 10,...}), then both A and B are false, and 
by Definition 1.5, implication А => В is true. It is worth noting that the case where А is true 
but B is false cannot occur, since no natural number which is divisible by 6 is not divisible 
by 3. 


Remark For an implication A => В, one can also say: 


(1) A implies B; 

(2) from A it follows B; 

(3) A is sufficient for B; 

(4) B is necessary for A; 

(5) A is true only if B is true; 
(6) if A is true, then В is true. 


The latter four formulations are used in connection with the presentation of mathematical 
theorems and their proof. 


Example 1.3 Consider the propositions 


H Claudia is happy today. 
E Claudia does not have an examination today. 


Then the implication Н == E means: If Claudia is happy today, she does not have an 
examination today. Therefore, a necessary condition for Claudia to be happy today is that 
she does not have an examination today. 


In the case of the opposite implication E == Н, a sufficient condition for Claudia to be 
happy today is that she does not have an examination today. 


If both implications Н => Е and E => Н are true, it means that Claudia is happy today if 
and only if she does not have an examination today. 


Definition 1.6 The proposition A <> В (read: A is equivalent to B) is called 
equivalence. Proposition A «== В is true if both propositions A and B are true or 
propositions A and В are both false. Otherwise, proposition A <=> В is false. 


According to Definition 1.6, the truth table of the equivalence A <> B is as follows: 
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Remark For an equivalence A <> В, one can also say 


(1) A holds if and only if B holds; 
(2) Ais necessary and sufficient for B. 


For a compound proposition consisting of more than two propositions, there is a hierarchy 
of the logical operations as follows. The negation of a proposition has the highest priority, 
then both the conjunction and the disjunction have the second highest priority and finally the 
implication and the equivalence have the lowest priority. Thus, the proposition 

A^B = С 
may also be written as 


[4 AB] e С. 


By means of a truth table we can investigate the truth value of arbitrary compound 
propositions. 


Example 1.4 We investigate the truth value of the compound proposition 


Av B = (B — А). 


One has to consider four possible combinations of the truth values of 4 and B (each of the 
propositions A and B can be either true or false): 


A TIT|FI|F 
B Т|Е|Т|Е 
AVB TI|T|T|F 
АУ B FIFIF|T 
A F|F|T|T 
B F\T\|F|T 
В =» А T|F T|T 
AvB = (B—A)|T|T|T|T 


In Example 1.4, the implication is always true, independently of the truth values of the 
individual propositions. This leads to the following definition. 


Definition 1.7 A compound proposition which is true independently of the truth 
values of the individual propositions is called a tautology. A compound proposition 
being always false is called a contradiction. 
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Example 1.5 We investigate whether the implication 


(A => B) = (A = В) (1.1) 


is a tautology. As in the previous example, we have to consider four combinations of truth 
values of propositions A and B. This yields the following truth table: 


A— B 
(А = В) — (4 = B) 


MO M utu 
Sey ey 


Э om oU Ug 


EN 
Sexe TTR 


Independently of the truth values of A and B, the truth value of the implication considered is 
true. Therefore, implication (1.1) is a tautology. 


Some further tautologies are presented in the following theorem. 


THEOREM 1.1 The following propositions are tautologies: 


(1) 4^AB&-B^A, AVB&>BVA 
(commutative laws of conjunction and disjunction); 

(2) AAB)AC < Ал (Вл С), (4 В) У С < АУ (ВУС) 
(associative laws of conjunction and disjunction); 

(3) AAB) МС < (АУС) л (ВМС), (AVB) AC <> (Ал С) У (Вл С) 
(distributive laws). 


Remark The negation of a disjunction of two propositions is a conjunction and, analo- 
gously, the negation of a conjunction of two propositions is a disjunction. We get 


АУ В <= Ал В апа Ал В <4 АУ В (de Morgan 5 laws). 


PROOF De Morgan’s laws can be proved by using truth tables. Let us prove the first equiv- 
alence. Since each of the propositions A and B has two possible truth values T and F, we have 
to consider four combinations of the truth values of A and B: 


ES - a 
БУ 


a 


EN 
by 

My NNN 

MNT BMAD 

SYN BAS 8 

Sys 


> ыы < 


al 
l 
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If we compare the fourth and last rows, we get identical truth values for each of the four 
possibilities and therefore the first law of de Morgan has been proved. B 


The next theorem presents some tautologies which are useful for different types of 
mathematical proofs. 


THEOREM 1.2 The following propositions are tautologies: 


() 4 B) 4 х Га > B) ^ B — A)|; 
(2) 4— В) л (В => C) == (4— С); 
(3) (4 = B) < (B= А) —— A ^B. 


PROOF We prove only part (1) and have to consider four possible combinations of the truth 
values of propositions A and В. This yields the following truth table: 


A 
B 


А => В 
В => А 

(A — B) ^ (В => A) 
А <= B 


BRAY 
-o uuu +з 
m omui "ч 
BARS 


The latter two rows give identical truth values for all four possibilities and thus it has been 
proved that the equivalence of part (1) is a tautology. a 


Part (1) of Theorem 1.2 can be used to prove the logical equivalence of two propositions, 
i.e. we can prove both implications A => В and В => A separately. Part (2) of Theorem 1.2 
is known as the transitivity property. The equivalences of part (3) of Theorem 1.2 are used 
later to present different types of mathematical proof. 


1.1.2 Universal and existential propositions 


In this section, we consider propositions that depend on the value(s) of one or several 
variables. 


Definition 1.8 A proposition depending on one or more variables is called an open 
proposition or a propositional function. 


We denote by A(x) a proposition A that depends on variable x. Let A(x) be the following open 
proposition: x? + x — 6 = 0. For x = 2, the proposition A(2) is true since 22 + 2 — 6 = 0. 
On the other hand, for x = 0, the proposition A(0) is false since 0? + 0 — 6 £0. 
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Next, we introduce the notions of a universal and an existential proposition. 


Definition 1.9 А conjunction 
A40 
x 


of propositions A(x), where x can take given values, is called a universal proposition. 
The universal proposition is true if the propositions A(x) are true for all x. If at least 
one of these propositions is false, the universal proposition is false. 


Proposition 
А0) 
x 


may be written in an equivalent form: 


Vx: A(x). 


Definition 1.10 A disjunction 
V 40) 
x 


of propositions A(x) is called an existential proposition. If at least one x exists such 
that A(x) is true, the existential proposition is true. If such an x does not exist, the 
existential proposition is false. 


Proposition 
V 40) 
x 
may be written in an equivalent form: 
dx: A(x). 


If variable x can take infinitely many values, the universal and existential propositions are 
compounded by infinitely many propositions. 
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Remark The negation of an existential proposition is always a universal proposition and 
vice versa. In particular, we get as a generalization of de Morgan’s laws that 


VA4c) — N sad ДАО) < VIO 


are tautologies. 


Example 1.6 Let x be a real number and 
A(x): х2 +3х-4 = 0. 


Then the existential proposition 
V AG) 
x 


is true since there exists a real x for which x? + 3x — 4 = 0, namely for the numbers x = 1 
and x = —4 the equality x? + 3x — 4 = 0 is satisfied, i.e. propositions A(1) and A(—4) 
are true. 


Example 1.7 Let x be a real number and consider 
Ах: a SO: 


Then the universal proposition 
A 4€) 
x 


is true since the square of any real number is non-negative. 


1.1.3 Types of mathematical proof 


A mathematical theorem can be formulated as an implication А => В (or as several impli- 
cations) where A represents a proposition or a set of propositions called the hypothesis or 
the premises (‘what we know’) and B represents a proposition or a set of propositions that 
are called the conclusions (‘what we want to know’). One can prove such an implication in 
different ways. 
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Direct proof 


We show a series of implications C; => Cj; fori = 0,1,...,n — 1 with Со = A and 
Cn = B. By the repeated use of the transitivity property given in part (2) of Theorem 1.2, this 
means that A => В is true. As a consequence, if the premises A and the implication А ==> В 
are true, then conclusion B must be true, too. Often several cases have to be considered in 
order to prove an implication. For instance, if a proposition A is equivalent to a disjunction 
of propositions, i.e. 


А = Ај А У... УА, 
опе can prove ће implication A => В by showing the implications 


Ар => В, Аз = В, ..., А, = В. 


Indirect proof 


According to part (3) of Theorem 1.2, instead of proving an implication А => В directly, 
one can prove an implication in two other variants indirectly. 


(1) Proof of contrapositive. For A => B, it is equivalent (see part (3) of Theorem 1.2) 
to show В => A. As a consequence, if the conclusion В does not hold (В is true) and 
В => А, then the premises A cannot hold (А is true). We also say that В => A isa 
contrapositive of implication A => В. 

Proof by contradiction. This is another variant of an indirect proof. We know (see 
part (3) of Theorem 1.2) that 


Q 


— 


(А — B) <> А ^ В. 


Thus, we investigate А ^ B which must lead to а contradiction, i.e. in ће latter case we 


have shown that proposition А ^ B is true. 


Next, we illustrate the three variants of a proof mentioned above by a few examples. 


Example 1.8 Given are the following propositions A and B: 
A: xl. 
3 1 
B: xX +4x+ — -3 42. 
x—1 


We prove implication А => B by a direct proof. 
To this end, we consider two cases, namely propositions 4; and 42: 


Aj: x « Il. 


Az: x » 1. 


It is clear that A <> A, V А2. Let us prove both implications 4; => В and 42 => B. 
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A, => B. Since inequalities x? < 1,4x < 4 and 1/(x — 1) < 0 hold for x < 1, we obtain 
(using the rules for adding inequalities, see also rule (5) for inequalities in Chapter 1.4.1) the 
implication 


1 
x«l] => AK pde <5. 
xu 
Moreover, 


1 1 
x R4x4 — «5 => 44x4+—-3 £2, 
x—1 x—1 


i.e. 4] = > B. 
Аз = B. Since inequalities x? > 1,4x > 4 and 1/(x — 1) > 0 hold for x > 1, we obtain 
(using the rules for adding inequalities) the implication 


1 
х> 1 = XA —— > 5. 
X 
Moreover, 
3 1 3 1 
X +4х+——>5 = x +4x4+ — —-342, 
x—1 x—1 


i.e. 4) => B. 
Due to А < A, V 45, we have proved the implication А => B. 


Example 1.9 We prove the implication А => B, where 
A: x is a positive real number. 


16 
B:x+—>8. 
x 
Applying the proof by contradiction, we assume that proposition 4 ^ B is true, i.e. 
> : 2x 16 
А л В: xisa positive real number and x + — < 8. 
x 
Then we get the following series of implications: 


16 
(z+ É <8) n æ> 0 2416 <8 
x 


x? + 16 < 8x — х2 — 8x +16 < 0 
x? — 8x + 16 <0 — (x — A < 0. 
Now we have obtained the contradiction (x — 4)? — 0 since the square of any real number 


is non-negative. Hence, proposition А ^ B is false and therefore the implication А => B is 
true. Notice that the first of the three implications above holds due to the assumption x > 0. 
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Example 1.10 We prove A < B, where 


A : x is an even natural number. 

В : x? is an even natural number. 
According to Theorem 1.2, part (1), А <> B is equivalent to proving both implications 
A = В and В => A. 


(а) A => B. Let x be an even natural number, i.e. x сап be written as х = 2n with n being 
some natural number. Then 


х2 = (2n)* = 4n? = 202m, 
i.e. x? has the form 2k and is therefore an even natural number too. 
(b) В => A. We use the indirect proof by showing A => B. Assume that x is an odd natural 
number, i.e. x = 2n + 1 with n being some natural number. Then 


x? = (Qn + 1 = 4n? + 4п +1 = 20? + 2n) +1. 


Therefore, x? has the form 2k + 1 and is therefore an odd natural number too, and we have 
proved implication А => B which is equivalent to implication В = A. 


Example 1.11 Consider the propositions 
A: -x +3x> 0 and B: x>0. 


We prove the implication A => B by all three types of mathematical proofs discussed 
before. 

(1) To give a direct proof, we suppose that —x* + 3x > 0. The latter equation can be rewritten 
as Зх > х2. Since x? > 0, we obtain 3x > x? > 0. From 3x > 0, we get x > 0, i.e. we have 
proved A => B. 

(2) To prove the contrapositive, we have to show that В => A. We therefore suppose that 
x « 0. Then 3x < 0 and —x? + 3x < 0 since the term —x? is always non-positive. 

(3) For a proof by contradiction, we suppose that A A B is true, which corresponds to the 
following proposition. There exists an x such that 


—x?+3x>0 and x < 0. 
However, if x < 0, we obtain 
—x? + 3x < —х? <0. 


The inequality —x* + 3x < 0 is a contradiction of the above assumption. Therefore, we have 
proved that proposition 4 ^ B is true. 
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We finish this section with a type of proof that can be used for universal propositions of a 
special type. 


Proof by induction 


This type of proof can be used for propositions 
A AQ. 
n 


where n = Ё, k + 1,..., and К is a natural number. The proof consists of two steps, namely 
the initial and the inductive steps: 


(1) Initial step. We prove that the proposition is true for an initial natural number n = k, 
i.e. A(K) is true. 

(2) Inductive step. To show a sequence of (infinitely many) implications A(k) => 
A(k 4- 1), A(k -- 1) = > А(Е+ 2), and so on, it is sufficient to prove once the implication 


A(n) => A(n t 1) 


for an arbitrary n € (k, k + 1,...}. The hypothesis that A(n) is true for some n is also 
denoted as inductive hypothesis, and we prove in the inductive step that then also A(n+ 1) 
must be true. In this way, we can conclude that, since A(x) is true by the initial step, 
A(k + 1) must be true by the inductive step. Now, since A(k + 1) is true, it follows again 
by the inductive step that A(k 4- 2) must be true and so on. 


The proof by induction is illustrated by the following two examples. 


Example 1.12 We want to prove by induction that 


n 
n 


1 
UL Seer, TUNE 


i=1 


is true for any natural number n. 
In the initial step, we consider A(1) and obtain 


1 


1 1 
ep GED E 


i=1 


which is obviously true. In the inductive step, we have to prove that, if A(n) is true for some 
natural number п = k, then also 


ntl 
1 п-+ 1 
А 1 i = 
ER. УЗ P n+2 


i=1 
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has to be true for the next natural number n + 1. We consider the inductive hypothesis A(n) 
and add 1/[(n + 1)(n + 2)] on both sides. This yields 


n 


1 1 n 1 
xD (nt+l(n+2) n+l Е (n+ 1) (и + 2) 


QC 1 nn +2) 41 


cdi (n+ +2) 


| и + 2п+1 


—(!@+)0 @+D@+2) 


ent. fl (n + 1)2 


0109-1)  GDGr2 


Exh и nil 


c id 1) cu 


Thus, we have shown that A(1) holds and that implication A(n) => A(n + 1) holds for an 
arbitrary natural number. Therefore, the proposition A(n) is true for all natural numbers n. 


A proof by induction can also be used when proving the validity of certain inequalities. 


Example 1.13 We prove by induction that 
A(n): 2”>п 
is true. In the initial step, we obviously get for п = 1 
AQ) : 2!=2>1, 
i.e. A(1) is true. In the inductive step, we have to prove implication A(n) => A(n + 1) with 
A(n+1): 2"%!\ »эп+1, 
i.e. we show that 
2% >п => 2 xnl 


Multiplying both sides of the inductive hypothesis А(л) (which is assumed to be true) by 2 
yields 


2.2" > 2n. 
From n > 1, we get 


2n>n+1. 
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Combining both inequalities gives 
2711 > n+], 


and thus we can conclude that A(n) is true for all natural numbers n. 


1.2 SETS AND OPERATIONS ON SETS 


In this section, we introduce the basic notion of a set and discuss operations on sets. 


1.2.1 Basic definitions 


A set is a fundamental notion of mathematics, and so there is no definition of a set by other 
basic mathematical notions. A set may be considered as a collection of distinct objects which 
are called the elements of the set. For each object, it can be uniquely decided whether it is 
an element of the set or not. We write: 


a € A: ais an element of set A; 
b ё A: b is not an element of set A. 


A set can be given by 


(1) enumeration, i.e. A = (a1,a2,...,a4] which means that set A consists of the elements 
а1,а2,...,аһ, OF 

(2) description, i.e. A = (a | property P} which means that set A contains all elements with 
property P. 


Example 1.14 Let A = (3,5,7,9,11]. Here set A is given by enumeration and it contains 
as elements the five numbers 3, 5, 7, 9 and 11. Set A can also be given by description as 
follows: 


A-—íx|3zxxx110) A (isan odd integer). 


Definition 1.11 А set with a finite number of elements is called a finite set. The 
number of elements of a finite set A is called the cardinality of a set A and is denoted 
by |41. 

A set with an infinite number of elements is called an infinite set. 


Finite sets are always denumerable (or countable), i.e. their elements can be counted one by 
one in the sequence 1,2,3,.... Infinite sets are either denumerable (e.g. the set of all even 
positive integers or the set of all rational numbers) or not denumerable (e.g. the set of all real 
numbers). 
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Definition 1.12 А set В is called a subset of a set A (in symbols: В С A) if every 
element of B is also an element of A. 

Two sets A and В are said to be equal (in symbols: A = B) if both inclusions A С В 
and B C A hold. 

A set A is called an empty set (in symbols: A = @) if A contains no element. 


If B is a subset of A, one can alternatively say that set B is contained in set A or that set 
A includes set B. In order to prove that two sets A and B are equal, we either prove both 
inclusions A C B and B C A, or alternatively we can prove that some element x is contained 
in set A if and only if it is contained in set В. The latter can be done by a series of logical 
equivalences. 


Example 1.15 Let A = {1,3,5}. We calculate the number of subsets of set A. 

We get the three one-element sets 4; = {1}, 42 = {3}, 43 = {5}, the three two-element sets 
A4 = {1,3}, А5 = {1,5}, А6 = (3,5) and the two limiting cases Й апа A. Thus we have 
found eight subsets of set A. 


Definition 1.13 The set of all subsets of a set A is called the power set of set A and 
is denoted by P(A). The limiting cases Ø and A itself belong to set P(A). 


The number of elements of the power set of a finite set is given by the following theorem. 
THEOREM 1.3 The cardinality of set P(A) is given by |P(A)| = 2l. 


For the set A given in Example 1.15, we have |A| = 3. According to Theorem 1.3, |P(A)| = 
2? — 8 is obtained what we have already found by a direct enumeration. 


1.2.2 Operations on sets 


In this section, we discuss some operations on sets. 


Definition 1.14 The set of all elements which belong either only to a set 4 or only to 
a set B or to both sets A and B is called the union of the two sets A and B (in symbols 
A U B, read: A union B): 


AUB-Íx|xeAvxe B). 


Set A U B contains all elements that belong at least to one of the sets A and B. 
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Definition 1.15 The set of all elements belonging to both sets A and B is called the 
intersection of the two sets A and B (in symbols А N B, read: A intersection B): 


ANB={x|xEeAAx eB}. 


Two sets A and В are called disjoint if A ПВ = 0. 


Definition 1.16 The set of all elements belonging to a set A but not to a set В is called 
the difference set of A and B (in symbols A \ B, read: A minus B): 


А\В = {х |хєАлх ¢ B). 


If B С A, then the set А \ В is called the complement of B with respect to A (in 
symbols B 4). 


Definitions 1.14, 1.15 and 1.16 are illustrated by so-called Venn diagrams in Figure 1.1, 
where sets are represented by areas in the plane. The union, intersection and difference ofthe 
two sets as well as the complement of a set are given by the dashed areas. For the difference 
set A \ B, we have the following property: 


AUB 


Figure 1.1 Venn diagrams for the union, intersection and difference of sets A and B. 
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THEOREM 1.4 Let А and B be arbitrary sets. Then: 


A\B=A\ (4N B) = (AU B)AB. 


In Theorem 1.4 sets A and B do not need to be finite sets. Theorem 1.4 can be illustrated by 
the two Venn diagrams given in Figure 1.2. 


ANB 
B B 


A\ (ANB) (AUB)\B 


Figure 1.2 Illustration of Theorem 1.4. 


Example 1.16 Let 
A = {3,5,7} and B= {2,3,4,7, 8}. 
Then 


AUB = {2,3,4,5,7,8}, ANB = (3,7), 
ANB — {5}, В\А = {2,4,8}. 


Example 1.17 Let А, B, C be arbitrary sets. We prove that 
(AUB)\C=(A\C)U(B\C). 


As mentioned before, equality of two sets can be shown by proving that an element x belongs 
to the first set if and only if it belongs to the second set. By a series of logical equivalences, 
we obtain 
x E (АОВ) \ С <=» хє АОВлхе С 

4> (хє Аухє В) лхеС 

4 (хєе4лх я С) у (хєВлх EC) 

= хє А\СмхєВв\с 

«= x € (4\ С) О (BXC). 
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Thus, we have proved that x є (A UB) \ C if and only if x € (А\ С) U (B \ C). In the above 
way, we have simultaneously shown that 


(4U B)NC € (А\С)О(В\С) 
and 
(4NC)U (BV C) € (AU B)AC. 


The first inclusion is obtained when starting from the left-hand side and considering the 
implications ‘==’, whereas the other inclusion is obtained when starting from the right-hand 
side of the last row and considering the implications ‘<—’. 


Next, we present some rules for the set operations of intersection and union. 


THEOREM 1.5 Let A, B, C, D be arbitrary sets. Then: 


(1) ANB=BNA, AUB=BUA 
(commutative laws of intersection and union); 

(2) (ANB)NC=AN(BNOQ), (AUB)UC=AU(BUC) 
(associative laws of intersection and union); 

(3 (АПВ) ЧС = (4UC)n (ВОС), (АОВ) АС = (АПС) О (ВПС) 
(distributive laws of intersection and union). 


As a consequence, we do not need to use parentheses when considering the union or the 
intersection of three sets due to part (2) of Theorem 1.5. 


Example 1.18 We illustrate the first equality of part (3) of Theorem 1.5. Let 
А = {3,4,5}, B= {3,5,6,7} and C= {2,3}. 

For the left-hand side, we obtain 
(AN B) UC = {3,5} U {2,3} = {2, 3, 5}, 

and for the right-hand side, we obtain 
(AUC)N (BUC) = {2,3,4,5} П {2,3, 5, 6, 7} = {2,3,5}. 


In both cases we obtain the same set, i.e. (4 N B) UC = (AU C) (BU C). 
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Remark There exist relationships between set operations and logical operations described 
in Chapter 1.1.1. Let us consider the propositions A and B: 


A: aéA; 
B: beP. 
Then: 


(1) conjunction A ^ B corresponds to intersection A’ N B’; 

(2) disjunction A v B corresponds to union 4’ U B’; 

(3) implication 4 > B corresponds to the subset relation (inclusion) A’ C B’; 
(4) equivalence А < B corresponds to set equality A’ = B’. 


The following theorem gives the cardinality of the union and the difference of two sets in the 
case of finite sets. 


THEOREM 1.6 Let A and B be two finite sets. Then: 


(1) Мов = [A| + |B| - |A В; 
Q) |A\B| = 141 — |4 N B| = |A U B| — |B|. 


Example 1.19 A car dealer has sold 350 cars during the quarter of a year. Among them, 
130 cars have air conditioning, 255 cars have power steering and 110 cars have a navigation 
system as extras. Furthermore, 75 cars have both power steering and a navigation system, 
10 cars have all of these three extras, 10 cars have only a navigation system, and 20 cars have 
none of these extras. Denote by A the set of cars with air conditioning, by P the set of cars 
with power steering and by N the set of cars with navigation system. 

Then 


|A| = 130, |P| = 255, IN| = 110, 
INnP| 275, |AnN n P| = 10, [INN (AU P)| = 10. 


Moreover, let C be the set of sold cars. Then 


|C| 2350 and |(AUNUP)c|=20 


I4 U PU N| = 330. 

First, the intersection N N P can be written as the union of two disjoint sets as follows: 
NnP- Len) \4] U [annar], 

Therefore, 


INA P| =|(NOP)\4|+|ANNNP| 
75 = (МПР) \ A| + 10 
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from which we obtain 
IN à P)N 4| = 65, 


i.e. 65 cars have a navigation system and power steering but no air conditioning. Next, we 
determine the number of cars having both navigation system and air conditioning but no 
power steering, i.e. the cardinality of set (№ П A) V P. The set № is the union of disjoint sets 
of cars having only a navigation system, having a navigation system plus one of the other 
extras and the cars having all extras. Therefore, 


IN| = INN(AU P)| + АПМ) \ P] - (ON O P) NA| -]AO N Y P | 
110 2 10+ |(ANN) \ P| 4-65 + 10 

from which we obtain 
(ANN) \ P| = 25, 


i.e. 25 cars have air conditioning and navigation system but no power steering. Next, we 
determine the number of cars having only air conditioning as an extra, i.e. we determine the 
cardinality of set А V (N U P). Using 


INU P| = IN| + IPI- INN P| 
= 110 + 255 — 75 = 290, 


we obtain now from Theorem 1.6 


|4\ (NUP)| = IAU(NU P)| - INUP| 
— 330 — 290 — 40. 


Now we can determine the number of cars having both air conditioning and power steering 
but no navigation system, i.e. the cardinality of set (А П P) V М. Since set A can be written 
as the union of disjoint sets in an analogous way to set N above, we obtain: 


|А| = |4\ (NUP)|+|(ANN)\ PI (40 P) \ м - 4n N P 
130 = 40 4-25 + (4 P) V N| +10 


from which we obtain 
I4 1 P)A N| = 55. 


It remains to determine the number of cars having only power steering as an extra, i.e. the 
cardinality of set P \ (A U N). We get 


IP| = IPN(AUNDI + (AO P) NNI УПР) VA] АПАР 
255 = |Р\ (AU N)| + 55 +65 + 10 


from which we obtain 


IP N (AU N)| = 125. 
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The Venn diagram for the corresponding sets is given in Figure 1.3. 


C (|C| = 350) 


(NAP) \ Al = 65 


IN \ (AU P)| = 10 


(AU NU P)o| = 20 


Figure 1.3 Venn diagram for Example 1.19. 


Example 1.20 Among 30 secretaries of several institutes of a German university, each 
of them speaks at least one of the foreign languages English, French or Russian. There are 
11 secretaries speaking only English as a foreign language, 6 secretaries speaking only French 
and 2 secretaries speaking only Russian. Moreover, it is known that 7 of these secretaries 
speak exactly two of the three languages and that 21 secretaries speak English. 

Denote by E the set of secretaries speaking English, by F the set of secretaries speaking 
French and by R the set of secretaries speaking Russian. Then 


|EUFUR| = 30. 
Moreover, we know the numbers of secretaries speaking only one of these foreign languages: 
Е\(ЕОР)| = 11, |F\(EUR|=6, [R\(EUP)| = 2. (1.2) 
In addition we know that [Е | = 21. For the cardinality of the set Lı of secretaries speaking 
exactly one foreign language we have || = 11+ 6+ 2 = 19. For the cardinality of set L2 


of secretaries speaking exactly two foreign languages, we have to calculate the cardinality 
of the union of the following sets: 


a= [пел [палео [еюл 


E (ЕП Е) \Е| + (Еп ®)\Е| enm z] = 
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Of course, in the last row we can add the cardinality of the three sets since they are pairwise 
disjoint. To find the number of secretaries speaking all three foreign languages, we have 
to subtract from the total number the number of secretaries speaking exactly one foreign 
language and the number of secretaries speaking exactly two foreign languages. We obtain 


JEN FOR = |EUFUR|- |L] — £2] = 30-19 -7 = 4. 


Next, we determine the number of secretaries speaking both French and Russian but not 
English, i.e. we want to determine the cardinality of set (Е N А) V E. Using |E| = 21, we get 


IG o PF)NRI T IGOR)NF| = JE- (ЕПЕПА) О[Е\ FUR] 
= |E] — [IE AF AR| + |E \ FU RI] 
= 21— (4+11) = 6. 
Therefore, 


If n \ E| = Val - (IGn NRI ЕПА \ Fl) =7-6= 1. 


Thus, 1 secretary speaks French and Russian but not English. This solution is illustrated by 
a Venn diagram in Figure 1.4. With the given information it is not possible to determine 
the cardinalities of the sets (E N F) V R and (E П R) \ F. In order to be able to find these 
cardinalities, one must know either the number of secretaries speaking French (i.e. |F|) or the 
number of secretaries speaking Russian (i.e. |R|). Without this information, we only know 
that the sum of the cardinalities of these two sets is equal to six (see Figure 1.4). 


5 


Figure 1.4 Venn diagram for Example 1.20. 


R 
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Definition 1.17 The set of all ordered pairs (a, b) with a € A and b € B is called the 
Cartesian product A x B: 


AxB-—((a,b)|aeA^b se B). 


Example 1.21 LetX = {(x | (1 < x < 4) ^ (x is a real number) and Y = {у | (2 < y < 
3) л (у is a real number)}. Then the Cartesian product X x Y is given by 


ХхҮ = {(х,у) | (1<х < 4) л (2 < у < 3) л (х, у are real numbers)}. 


The Cartesian product can be illustrated in a coordinate system as follows (see Figure 1.5). 
The set of all ordered pairs (x,y) € X x Y are in one-to-one correspondence to the points 
in the xy plane whose first coordinate is equal to x and whose second coordinate is equal to y. 


V, 


њ 
N 
w 
w+ 
сл 


—1- 


Figure 1.5 Cartesian product X x Y in Example 1.21. 


Example 1.22 We prove the following equality: 
(A, U 42) x B= (Ау x В) О (42 x В). 
By logical equivalences we obtain 


(a,b) є (41 U A2) x B& ac А ОА лБ є В 
«= (ає Ај Має А5) л Бє В 
<4 (lac Ау лБ є В) ~ (а є А ^Ь є В) 
<> (a,b) € Ay х BV (a,b) є А х B 
«= (a, b) € (41 х В) U (45 x В). 


We have shown that an ordered pair (a, b) is contained in set (A; U 42) x B if and only if it 
is contained in set (41 x В) О (42 х B). Therefore the corresponding sets are equal. 
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Example 1.23 Let = {2,3} and B = {3, 4, 5}. Then 
Ax B= {(2, 3), (2,4), (2,5), (3, 3), 3,4), (3, 5)} 
and 


B x A = {(3,2), (3,3), (4, 2), (4, 3), (5, 2), (5, 3)}. 


Example 1.23 shows that in general we have A x B 5 B x A. However, as the following 
theorem shows, the cardinalities of both Cartesian products are equal provided that A and В 
are finite sets. 


THEOREM 1.7 Let А,В be finite sets with [4| = n and |B| = m. Then: 
Ах B| = |B x A| = |A|- |B| =n - m. 


Now we generalize the Cartesian product to the case of more than two sets. 


Definition 1.18 . Let А, 45, ..., 4, be sets. The set 


n 
ХА; = Арх А ХО х А, 


i=1 


—((a1,a05,...,a,) |а € Ay ла € А A+++ A ay € An} 


is called the Cartesian product of the sets А1, A2,...,An. 


n 
An element (a1, а2,...,аһ) € ХА; is called ordered n-tuple. 
i=l 


n 
For Ај = 42 =... = A, = A, we also write XA; = A”. 


i=1 
Example 1.24 LetA = {3,4} and B = {10, 12, 14, 16} be given. Then the Cartesian product 
A? x B is defined as follows: 
Ах В = {(x,y,z) | x,y € 4,2 є В). 
In generalization of Theorem 1.7, we obtain |A? x В| = |A|? - |B] = 27-4 = 16. 
Accordingly, if A = {1,2}, B = {b1, b2} and C = {0, 2,4), we get|AxBxC| = |A|-|B|-|C| = 
2.2.3 = 12, i.e. the Cartesian product A x B x C contains 12 elements: 


A x B x С = {(1, 51,0), (1, 51, 2), (1, b1, 4), (1, b2, 0), (1, b2, 2), (1, b2, 4), 
(2, b1, 0), (2, b1, 2), (2, b1, 4), (2, b2, 0), (2, 52, 2), (2, b2, 4)]. 
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1.3 COMBINATORICS 


In this section, we summarize some basics on combinatorics. In particular, we investigate 
two questions: 


(1) How many possibilities exist of sequencing the elements of a set? 
(2) How many possibilities exist of selecting a certain number of elements from a set? 


Let us start with the determination of the number of possible sequences formed with a set of 
elements. To this end, we first introduce the notion of a permutation. 


Definition 1.19 Let M = {a1,a2,...,an}. Any sequence (ap,,ap),...,@p,) of all 
elements of set M is called a permutation. 


In order to determine the number of permutations, we introduce л! (read: n factorial) which 
is defined as follows: n! = 1-2 ·...· (и – 1) -n forn > 1. For n = 0, we define 0! = 1. 


THEOREM 1.8 Letaset M consisting ofn > 1 elements be given. Then there exist P (n) = п! 
permutations. 


The latter theorem can easily be proved by induction. 


Example 1.25 We enumerate all permutations of the elements of set M = {1,2,3}. We 
can form P(3) — 3! — 6 sequences of the elements from the set M : (1,2,3), (1,3,2), 
(2, 1,3), (2, 3, 1), G, 1, 2), G, 2, 1. 


Example 1.26 Assume that 7 jobs have to be processed оп a single machine and that all 
job sequences are feasible. Then there exist P(7) = 7! = 1-2-.... 7 = 5,040 feasible job 
sequences. 


Example 1.27 Given 6 cities, how many possibilities exist of organizing a tour starting 
from one city visiting each of the remaining cities exactly once and to return to the initial 
city? Assume that 1 city is the starting point, all remaining 5 cities can be visited in arbitrary 
order. Therefore, there are 


Р(5) = 5\ = 1-2....-5 = 120 


possible tours. (Here it is assumed that it is a different tour when taking the opposite sequence 
of the cities.) 
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If there are some non-distinguishable elements the number of permutations of such elements 
reduces in comparison with P(n). The following theorem gives the number of permutations 
for the latter case. 


THEOREM 1.9 Let n elements consisting of groups of и], 72,...,n, non-distinguishable 
(identical) elements with n = n, + n2 +--- + n, be given. Then there exist 


n! 
P(n; n,no,...,ny) = е еу 
пі! - по! -... пу 


permutations. 


Example 1.28 How many distinct numbers with nine digits exist which contain three times 
digit 1, two times digit 2 and four times digit 3? Due to Theorem 1.9, there are 


9! 5-6-7-8-9 
ie 6-2 


PO; 3,2,4) = 3 = 1,260 


distinct numbers with these properties. 


Example 1.29 In a school, a teacher wishes to put 13 textbooks of three types 
(mathematics, physics, and chemistry textbooks) on a shelf. How many possibilities exist 
of arranging the 13 books on a shelf when there are 4 copies of a mathematics textbook, 
6 copies of a physics textbook and 3 copies of a chemistry textbook? The problem is to find 
the number of possible permutations with non-distinguishable (copies of the same textbook) 
elements. 

For the problem under consideration, we have n = 13,n, = 4,n2 = 6 and пз = 3. Thus, 
due to Theorem 1.9, there are 


13! 7-8-9-10-11-12-13 


PIS оаа е 4!.3! 


= 60,060 


possibilities of arranging the books on the shelf. 


Next, we investigate the question of how many possibilities exist of selecting a certain 
number of elements from some basic set when the order in which the elements are chosen 
is important. We distinguish the cases where a repeated selection of elements is allowed and 
forbidden, respectively. First, we introduce binomial coefficients and present some properties 
for them. 
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Definition 1.20 For 0 < k < n with k,n being integer, the term 


(o) b n! 
k) Е. (n — k)! 


is called a binomial coefficient (read: from n choose k). For k > n, we define 
n 
ex (0), 
Ü 


For instance, we get 


E = = 210. 
4 


10 10! 1:2.3.4.5.6.7.8.9.10 — 7.8.9.10 
41-6! (1-2-.3-4)-(1-2.3-4-5-6) 1-2-3-4 


In order to determine the binomial coefficient () with n > k, we have to compute the quotient 
of the product of the k largest integers no greater than n, i.e. (n—kK+1)-(n—k+2)-...-n, 
divided by the product of the first k integers (i.e. 1:2 ·... · К). The following theorem gives 
basic properties of binomial coefficients. 


THEOREM 1.10 Let 0 < k < n with k and n being integers. Then: 
(1) аара п\ _ n Y 
0) in) ^ k) \п-К/” 
n n п+ 1 
2 = . 
o (thera) (os) 
The first property in Theorem 1.10 describes a symmetry property of the binomial coefficients 
and the second can be interpreted as an addition theorem for binomial coefficients. Applying 


the binomial coefficient, we get the following theorem: 


THEOREM 1.11 Let a, b be real numbers and л be a natural number. Then: 


(a+ b)" = а + (Deo (ew +...+ ( i jor Lp 
xm 


Remark Using a = b = 1 in Theorem 1.11, we get the following special case: 


»(0-Q«Q (5) а 


The above comment confirms the previous result that the power set of а set having и 
elements has cardinality 2”. The coefficients in Theorem 1.11 can be determined by means 
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of Pascal 5 triangle: 


(a+b)? 1 

(a+b)! 1 1 
(a+b)? 1 2 1 

(a 4- b? 1 3 3 1 


(a + b) 1 4 6 4 1 


G + by" n n nV n n n 
0 1 k k+1)  \n-1 n 
(a+b! ti) fn4l n+1 п+1\ (n+1 
0 1 i k+1 = п п+ 1 
In Pascal’s triangle, each inner number is obtained by adding the two numbers in ће row 
above which are standing immediately to the left and to the right of the number considered 
(see equality (2) in Theorem 1.10). For instance, number 6 in the middle of the row for 
(a + Б)“ is obtained by adding number 3 and number 3 again in the row of (a + b)? (see 
the numbers in bold face in rows four and five of Pascal’s triangle above). Moreover, the 


numbers in each row are symmetric (symmetry property). In the case of n = 2, we obtain 
from Theorem 1.11 and Pascal’s triangle the well-known binomial formula 


(a+ b = а + 2ab +B’. 


Example 1.30 We determine the fifth power of 2x + y. Applying Pascal’s triangle, we 
obtain 


(2x + y? = (2х)? + 5Q3)*y + 10223? + 10(2x)y? + 5(2x)y4 + y? 
= 32x° + 80x4y + 80333? + 40323? + 10xy* + у?. 


In order to get a formula for (a — b)", we have to replace b by —b in Theorem 1.11, which 
leads to a minus sign in each second term (i.e. in each term with an odd power of b). We 
illustrate the computation by the following example. 


Example 1.31 We determine the third power of x — 2y. From Theorem 1.11, we obtain 


3 
&-2»? = [x + 029] 
= x? — 3x? (2y) + 3x(2y)? — (2у)? 
= хў — бх2у + 12x? — 8y?. 


Next, we consider selections of certain elements from a set starting with ordered selections, 
where the order in which the elements are chosen from the set is important. 
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THEOREM 1.12 The number of possible selections of k elements from n elements with 
consideration of the sequence (i.e. the order of the elements), each of them denoted as a 
variation, is equal to 


(1) V(k,n) = if repeated selection is forbidden (1.e. every element may occur 
_ only once in each selection); 
(2) V(k,n) = n* if repeated selection is allowed (i.e. an element may occur arbitrarily 


often in a selection). 


n! 
(n— k)! 


Example 1.32 A travelling salesman who should visit 15 cities exactly once may visit only 
4 cities on one day. How many possibilities exist of forming a 4-city subtour (i.e. the order 
in which the cities are visited is important). 

This is a problem of determining the number of variations when repeated selection is 
forbidden. From Theorem 1.12, we obtain 


15! 15! 
V(4,15) а = > = 12- 13: 14- 15 = 32,760 
(15—4) 11! 


different 4-city subtours. 


Example 1.33 A system of switches consists of 9 elements each of which can be in position 
‘on’ or ‘off’. We determine the number of distinct constellations of the system and obtain 


V(9,2) = 2° = 512. 


Next, we consider the special case when the order in which the elements are chosen is 
unimportant, i.e. we consider selections of unordered elements. 


THEOREM 1.13 The number of possible selections of k elements from n elements with- 
out consideration of the sequence (i.e. the order of elements), each of them denoted as a 
combination, is equal to 


(1) C(k,n) = B if repeated selection of the same element is forbidden; 


zs k—1 
(2) C(k,n) = (" 5 А ) if repeated selection of the same element is allowed. 


Example 1.34 In one of the German lottery games, 6 of 49 numbers are drawn in arbitrary 
order. There exist 


= 13,983,816 


49 49! 44-45. .... 49 
C(6,49) — ( )- 2 = 


6 43 1-2.....6 
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possibilities of drawing 6 numbers if repeated selection is forbidden. If we consider this game 
with repeated selection (1.e. when a number that has been selected must be put back and can 
be drawn again), then there are 


— 25,827,165 


= 54 54! — 49.50.....54 
ce. = ( js Seas 


6 48! 1-2.....6 


possibilities of selecting 6 (not necessarily different) numbers. 


Example 1.35 Ata Faculty of Economics, 16 professors, 22 students and 7 members of the 
technical stafftake part in an election. A commission is to be elected that includes 8 professors, 
4 students and 2 members of the technical staff. How many ways exist of constituting the 
commission? 

There are 


— 12,870 


16 16! 9-10-11-12-13-14-15- 16 
cass = ( ) 


8/ 8.8 — 1:2.3.4.5.6. 7.8 


ways of electing 8 out of 16 professors, 


22 22! 19-20-21-22 
соз, = (7) = zs alana очы, СО? 


ways of electing 4 out of 22 students and 


7 7! 6.7 
сазу=(„}|=згзт= = 


ways of electing 2 out of 7 members of the technical staff. Since any selection ofthe professors 
can be combined with any selection of the students and the technical staff, there are 


C(18, 8) - C(22, 4) - C(7,2) = 12,870. 7,315.21 = 1,977, 025, 050 


ways of constituting such a commission. 


Example 1.36 From a delivery containing 28 spare parts, 5 of which are faulty, a sample 
of 8 parts is taken. How many samples that contain at least one faulty part are possible? From 
Theorem 1.13, the number of samples with at least one faulty part is given by 


28 23 
C(28, 8) — C(23, 8) = b C — 3,108,105 — 490,314 — 2,617,791. 


Here we have subtracted the number of samples without any faulty spare part from the total 
number of possible samples. 
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14 REAL NUMBERS AND COMPLEX NUMBERS 


This section deals with number systems. In the first subsection, we summarize the most 
important facts and rules concerning numbers. Then we extend the set of real numbers by 
introducing so-called complex numbers. 


1.4.1 Real numbers 


The infinite set N = {1, 2,3, . . .} is called the set of natural numbers. If we union this set with 
number zero, we obtain the set № = N U {0}. The natural numbers can be represented on a 
straight line known as a number line (see Figure 1.6). 


Figure 1.6 Natural numbers on the number line. 


We can perform the operations of addition and multiplication within the set N of natural 
numbers, i.e. for a, b € N, we get that a + b belongs to set N and a - b belongs to set N. 
In other words, the sum and product of any two natural numbers are again a natural number. 
However, the difference and quotient of two natural numbers are not necessarily a natural 
number. 


The first extension which we perform is to add the set of negative integers (—1, —2, —3,...] 
to set No which yields the set Z of integers, i.e. 


Z-NoU(—-1,—2,—3,...]). 


This allows us to perform the three operations of addition, subtraction and multiplication 
within the set of integers, i.e. for a, b € Z, we get that a + b belongs to set Z, a — b belongs 
to set Z and a - b belongs to set Z. 


To be able to perform the division of two integers, we introduce the set of all fractions p/q 
with p € Z,q € N. The union of the integers and the fractions is denoted as set Q of rational 
numbers, i.e. 


@=2u{? ре, aen]. 
q 


Now all the four basic operations of addition, subtraction, multiplication and division (except 
by zero) can be performed within the set Q of rational numbers. 


Consider next the equation x? — 2. This equation cannot be solved within the set of rational 
numbers, i.e. there exists no rational number p/q such that (p/q)? — 2. This leads to the 
extension of the set Q of rational numbers by the irrational numbers. These are numbers 
which cannot be written as the quotient of two integers. There are infinitely many irrational 
numbers, e.g. 


М2 2 1.41421, V3 ~ 1.73205, e~ 2.71828 and л © 3.14159. 
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Irrational numbers are characterized by decimal expansions that never end and by the fact 
that their digits have no repeating pattern, i.e. no irrational number can be presented as a 
periodic decimal number. 


The union of the set Q of rational numbers and the set of all irrational numbers is denoted as 
set R of real numbers. We get the following property: there is a one-to-one correspondence 
between real numbers and points on the number line, i.e. any real number corresponds to a 
point on the number line and vice versa. 


The stepwise extension of the set N of natural numbers to the set R of real numbers is 
illustrated in Figure 1.7. Next, we summarize some basic rules for working with the set of 
real numbers. 


Properties of real numbers (a, b,c € R) 


(1 a+b=b+a (commutative law of addition); 
(2) there exists a number, 0 є К, such that for all a 


а+0= 0 +а= а; 
(3) for all a, b there exists a number, х € К, with 
а+х=х+а =; 


(4) а+(Ь+с) = (at b) + с (associative law of addition); 
(5) a-b=b-a (commutative law of multiplication); 
(6) there exists a number, 1 є К, such that for all a 


а:-1=1:а= а; 
(7) for all a, b with a 5 0 there exists a real number, х є К, such that 
а:х=х:а =; 


(8) (a-b)-c=a-(b-c) (associative law of multiplication); 
(9 (a+b)-c=a-c+b-c (distributive law). 


For special subsets of real numbers, we also use the interval notation. In particular, we use 


[a,b] = {xE R |a <x € b; 
(a,b) = {xe R|a<x <}; 


[a,b) = (x Ee R|a<x <}; 


(a,b] = {x €R|a<x< dD}. 


The interval [a,b] is called a closed interval whereas (a,b) is called an open interval. 
Accordingly, the intervals (a, b] and [a, b) are called half-open (left-open and right-open, 
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Real numbers R 


Rational numbers Q Irrational numbers 


Integers Z Fractions 


Negative integers 
Natural numbers N 


and number 0 


Figure 1.7 Real numbers and subsets. 


respectively) intervals. The set R of real numbers can also be described by the interval 
(—оо, оо). The set of all non-negative real numbers is denoted as R+, i.e. R+ = [0, оо). 


Example 1.37 Often a sum or difference T of different mathematical terms is given and 
we wish to find a product representation 1f possible. Let for instance the equality 


T = 2ax — 12ay + 6bx — 36by 

be given. Factoring out 2a in the first two terms and 6b in the third and fourth terms, we get 
T = 2а. (x — бу) + 6b - (x — бу). 

Now, factoring out x — 6y, we get 


T = (x бу): Qa + 6b) 
—2.(x — 6y) - (a + 3b). 


We now summarize some well-known rules for working with fractions. 
Rules for fractions (a,c € R, b,d € R \ {0}) 
a с ad + Бс . 


DP X WE 
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a c ac 
(2) hoa dup 
a c ad 


For a fraction, we can also write a/b (in particular we use the latter form within the text) or 
a: b. According to rule (1), fractions have to have the same denominator before adding or 
subtracting them. The product of two fractions is obtained by multiplying both numerators 
and both denominators (rule (2)). The quotient of two fractions is obtained by multiplying the 
fraction in the numerator by the reciprocal value of the fraction in the denominator (rule (3)). 
As an application of these rules, we consider the following examples. 


Example 1.38 We compute 


17 5 7 


Z= 45 10 15 


Applying rule (2), we have to find a common denominator for all fractions. Instead of taking 
the product 45-10-15 = 6,750, we can use the smallest common multiple of 45, 10 and 15, 
which is equal to 90, and as a result we obtain 


17.2—5.9-- 7.6 34—45+42 31 
Z--— = — 2 
90 90 90 


Example 1.39 Let 


a x 


a—x atx _ 
a x 


a+x a—x 


provided that x Æ a and x Æ —a. The latter conditions are sufficient to guarantee that no 
denominator in the above equation is equal to zero. Transforming the left-hand side, we get 


a(a 4- x) — x(a — x) 


(a — x)(a + x) _ а2+ах—ах+х? _ 
а(а= х)+х(а+х а -ах+фах+х2 _ 
(a + x)(a — x) 


from which we obtain x = 1 as the only solution (if a = 1 anda = — 1). 


Example 1.40 Assume that the fuel consumption of a car is 6.5 1 per 100 km. How many 
kilometres can the car go if the tank of this car is filled up with 41 1? 
One can solve this problem after establishing a proportion: 


6.5 : 100 = 41 : x, 
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i.e. if 6.5 | are consumed per 100 km, 41 | are consumed per x km. After cross-multiplying, 
we get 


6.5x = 41 · 100 
from which we obtain 
x 7 630.77 km, 


i.e. the car can go (approximately) 630 km with 41 1. 


In the following, we survey some rules for working with inequalities. 
Rules for inequalities (a,b,c,d € R) 


(1) Ifa <b, thena+c<b+canda—c<b-—c; 
(2) ifa < bandb < с, thena < c; 
(3) ifa < b and c > 0, then 


E 


B 
с 


ас < be, = < 
с 
(4) ifa < b and c < 0, then 


E 


P 
с 


а 
ас > Бе, -> 
c 


(5) ifa < bande <d,thena+c<b+d; 
(6) if0 <a < b, then 


(7 ifa? < band b > 0, then 


wm 


a>—Vb ^ a<vb (or correspondingly, — Vb < a x Nb). 


Rule (1) indicates that we can add or subtract any number to an existing inequality without 
changing the inequality sign. Rule (2) gives a transitivity property (i.e. one can ‘connect’ 
several inequalities with the same inequality sign). Rule (3) says that the inequality sign does 
not change when multiplying (or dividing) both sides of an inequality by a positive number, 
but the inequality sign changes when multiplying (or dividing) both sides by a negative 
number (see rule (4)). In particular, if c = —1, then it follows from a < b that —a > —b. 
Rule (5) expresses that one can add inequalities with the same inequality sign. However, 
we cannot state a corresponding rule for the subtraction of two inequalities with the same 
inequality sign. Rule (6) states that, if we consider the reciprocal values of two positive 
numbers, the inequality sign is changed. Finally, rule (7) is helpful for solving quadratic 
inequalities. 
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Example 1.41 We determine all real numbers satisfying the inequality 
2x—4x 5x-F 1. 


Putting all terms depending on x on the left-hand side and the remaining terms on the right- 
hand side, we get 


2x—5x € 1-4 
which corresponds to 
—3x <5. 


After dividing both sides by —3 (i.e. the inequality sign changes), we obtain 


x= 


Wl CA 


Thus, the set of all real numbers from the interval [5/3, oo) satisfies the given inequality. 


Definition 1.21 Leta є R. Then 


lal a fora > 0 
21| = 
—a fora<0 


is called the absolute value of a. 


From Definition 1.21 it follows that the absolute value |a| is always a non-negative number. 
(Note that, if a < 0, then —a > 0.) The absolute value of a real number represents the 
distance of this number from point zero on the number line. For instance, |3| = 3, |- 5| = 5 
and |0| = 0. The following theorem gives some properties of absolute values. 


THEOREM 1.14 Leta,b є R and c € Ку. Then: 


(D |- al = ld 

(2) lal Sc =c Saxe; 

(3) |a] > c & (a«—o) v (a> с); 
(4) | lal — [|| < la bl < lal + Ibl; 
(5) la-| = 141-11 


Rule (2) expresses that, for instance, we can rewrite inequality 


x| < 3 
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in an equivalent form as follows: 
—3<x<3. 

Similarly, inequality |y| > 6 can be written in an equivalent form as follows: 
y«-—6 Or у> 6. 


The inequalities of part (4) are known as triangle inequalities, referring to the geometric pro- 
perties of a triangle (see also Chapter 6): the length of the third side of a triangle (i.e. |a + b|) 
is always no greater than the sum of the lengths of the other two sides (i.e. |а| + |b|) and is 
always no smaller than the absolute value of the difference of the lengths of the other two 
sides. 


Often, inequalities contain absolute values of certain mathematical terms, and the problem 
is to find the solution set satisfying this inequality. In such situations, usually certain cases 
have to be considered. Let us consider the following examples. 


Example 1.42 Let the inequality 
|x — 3] <5 


be given. In order to find the set of all solutions satisfying the above inequality, we have to 
distinguish the following two cases. 


Case 1 x—3 > 0 This means that x > 3. Then |x — 3| = x — 3, and we obtain from 
x—3 < 5 the set of solutions Г in case 1: 


Ly-ixeR|I(xz3)^(xz8)-— [3,8]. 


Case2 х—3 < 0 This means that x < 3. Then |x — 3| = —(x — 3) = —x + 3, and we 
obtain from —x + 3 < 5 the set of solutions Г in case 2: 


L = {x € R | œ < 3) A (x > —2)} = [—2,3). 


In order to get the set L of all solutions satisfying the given inequality, we have to find the 
union of both sets / and L5: 


L= Li UL) = {x € R | x € [-2,3) U [3, 8]} = [-2, 8]. 


Example 1.43 Let inequality 


|2x — 2| 
x+3 


<1 (1.3) 


be given. We determine the set of all real numbers satisfying this inequality. In order to solve 
it for x, we have to remove the absolute values and the fractions. However, the transformation 
depends on whether the term 2x — 2 is positive or negative and whether the denominator is 
positive or negative. Therefore, we determine the roots of both terms. From 2x — 2 = 0, 
it follows that x = 1 and from x + 3 = 0, it follows that x = —3. Therefore, we have to 
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consider the following three cases (where each case leads to a different transformation of 
inequality (1.3), note also that x = —3 has to be excluded while x = 1 is allowed): case 1: 
x < —3; case 2: —3 <x < 1; case 3: x > 1. 


Case 1 Ifx < —3, inequality (1.3) turns into 


2 — 2x 
x+3 


«l1 


since |2x — 2| = 2 — 2x due to inequality 2x — 2 < 0 and Definition 1.21. Multiplying both 
sides by the negative number x 4- 3, we obtain 


2—2x x43 


and therefore, 3x < —1, which corresponds to inequality x < —1/3. Hence, in case 1 we 
have the following set Lı of solutions: 


L= [regie <-3)A (x <-3)| = eni < -3) = Cos -. 


Case2 If—3 « x < 1, inequality (1.3) reads as in case 1: 


2—2x 


« ]. 
x43 


However, since x + 3 > 0 in case 2, multiplying both sides by the positive number x + 3 
gives 


2—2x «x43 


which yields 3x > —1, and thus after dividing by 3 we get 
1 


xe. 


3 


Thus, in case 2 we get the following set L2 of solutions: 


а= ences «хл (x> -5)]-(- 


Case3 Ifx > 1, then |2x — 2| = 2x — 2 and we have 


r1) 
Ll]. 
3 


2x—2 


< 1. 
x+3 


Multiplying both sides by the positive number x + 3, we get 
2x—2 «x43 
and therefore x < 5. The set L4 of solutions is as follows: 


1з={хєК|(х>1)л (х < 5)} = [1, 5). 
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The complete set Z of solutions of inequality (1.3) is obtained as the union of the three sets 
Li, La and Із, 1.е. 


1 
1= 0000 = |> R |х € (оо, »v( JI 


Absolute values are, for instance, helpful when solving quadratic inequalities. 


Example 1.44 We determine all real numbers satisfying the inequality 
—3х? + 18x — 15 > 0. 
First, we divide both sides by the negative number —3 which changes the sign in the inequality: 
x? —6x+5<0. (1.4) 
One way to solve this inequality is to consider the corresponding equation 
x —6x+5=0 
and determine its roots: 
xj23449—5-25 and x. =3-J9—-5=1. 


Now we check for an arbitrary ‘trial’ value (e.g. number 0) whether the given inequality is 
satisfied. We obtain 


07- 6-045 £0, 


and therefore all x € (—оо, 1) do not satisfy the given inequality (for all these values, we 
have x? — 6x -- 5 > 0). Since we have obtained two distinct roots, the sign of the term 
x? — бх + 5 changes ‘at’ the roots x; and x2, and therefore the set of solutions L is given by 
L = [1,5]. (Note that for x > 5, the sign changes again so that the given inequality is not 
satisfied for these values.) 

An alternative way to solve inequality (1.4) is as follows. We rewrite the left-hand side using 
a complete square in the form x? — 2a + a? = (x — a)? which gives 


x — 6x + 9-4= (х – 3)2 – 4 < 0, 
or equivalently, 
(х — 3)? < 4. 
If we now take the root on both sides, we get the two inequalities 


х-3 > –2 and х-3 < 2. 
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Using the definition of the absolute value, we can rewrite the latter condition as 
Ix-3| <2 


which corresponds to the set of solutions L = [1, 5]. 


Let a” = b with a, b € R, and n € N, i.e. b is the nth power of number a. Then a = Vb is 
referred to as the nth root of number b. In the following, we review the most important rules 
for working with powers and roots. 


Rules for powers and roots 
Powers (a, b, т, п € №): 


(1) a” .g^ = qnn. 
а" 


(2) а". b" = (aby"; 
(3) £- =a" (a #0); 
a 
a” ann 
o 5=(5) (b #0); 
(5) а= 2 (a #0); 
a 


(6) (а")" = a" = (а")"; 


Roots (a,b € Ку, тє 2,п є №): 
(1) Va-b= уа. Vb; 


(2) % = WI (b # 0); 
as #0) 
(3) Уа" = aml" (a4 0 or m/n > 0). 
From the latter equality, we get for m — 1 the special case 


Ya = al^, 


Example 1.45 We consider the root equation 
Мх + мх +3 = ух + 8 
and determine all solutions x. Taking the square on both sides, we obtain 


(Мх + x T3 = x48 
х+2- ух: мх+3 + (х +3) =х +8, 
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which can be written as 
2. x: Nx 4-3 = —x 4 5. 

Taking now again the square on both sides, we obtain 
4х(х + 3) =x? — 10x + 25 

which is equivalent to the quadratic equation 
3х2 + 22x – 25 = 0 


or, equivalently, 


x12 = Р? + (2) -« 
1,2 27 


provided that (p/2)? — q > 0. Hereafter, the sign + means + or —. Hence, the two solutions 
of the equation obtained are 


11 121 25 11 14 
X] = + + = + = 1 


and 


11 101. 25 11 14 25 
3 9 3 3 3 3: 


x2 


In the case of root equations, we have to verify that the values obtained are indeed a solution 
of the original equation. For x; = 1, the root equation is satisfied. However, for x = —25/3, 
A/x2 is not defined since x2 is negative. Therefore, x; = 1 is the only real solution of the 
given root equation. 


Example 1.46 Ifthe denominator of a fraction contains roots, it is often desirable to trans- 
form the fraction such that the denominator becomes rational. We illustrate this process of 
rationalizing the denominator. Let 


a 


2 = : 
xy 


In this case, we multiply both the numerator and the denominator by x — ,/y, and we obtain 


ax- P) ак-ак y») 


2 = = 
CEDE- xb-CGQyY? х? – у 
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Example 1.47 We compute 


x= у a. Ya (a > 0) 
and obtain 


x= Va .a2/5 = ў а7/5 = q1/65) — 47120 X a’. 


Example 1.48 Let the exponential equation 
2€ 7 —e*-1=0 


be given. In order to determine all real solutions x of this equation, we can transform it into 
a quadratic equation by using the substitution y = e *. Taking into account that 


e? = ( ex) 
we get the equation 
232 -у-1=0 


which сар be rewritten as 


The latter equation has the two solutions 


xard aN ips dd. 
Tec ТТ 2 4 d pog 


and 


ENSEM 0 NN 
2=4 VIG. 121—474 19 4 4 2 


Substituting variables back, this gives 


1 
е = 1 апа e = – с, 
2 
From the first equality we get the solution ху = — In 1 = 0 while the second equality does 


not have a solution since e 2 is always greater than zero. Therefore, x} = 0 is the only 
solution of the given exponential equation. 
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Example 1.49 We consider the equation 
xt + 9(a? 4x2) = —а?х? 

and determine for which values of parameter a real solutions exist. We obtain 
x* + 9a? + 9x? ax? = x4 + (9+ a’)x? + 9a? = 0. 


Substituting z = x? 


, we get a quadratic equation in the variable z: 
22 + (9 + а?) + 9а? = 0. 


This equation has the two solutions 


9 + а? (9 + a?)? 
212 = + 9а? 
? 2 4 


9-g^, |81 + 18a? + at — 36a? 
2 4 


94-2  |O-@} 
Bo 4 


9+а2 9-а? 
2 2 7 


From the last equation we obtain 


гу = —9 and 22 = —а?. 
For the first solution in z, we get after back substitution x? = —9, i.e. there is no real solution 
of this equation. Considering the second solution and substituting back, we get x? = —a? 


which does not have a real solution for a Æ 0. Thus, only for a = 0 do we have x? = 0, i.e. 
in this case we get the solutions x, = х2 = 0. 


Definition 1.22 Let a* = b with a, > 0 anda Æ 1. Then 
x = log, b 


is defined as the /ogarithm of (number) Б to the base а. 


Thus, the logarithm of b to the base a is the power to which one must raise a to yield b. As a 
consequence from Definition 1.22, we have 


Bab — b. 
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Rules for logarithms (а > 0, а +1, х > 0, y > 0,n € R) 


(1) log, 1 = 0; log,a=1; 
(2) log,(x-y) = log, x + log, y; 


x 
(3) log, E) = log, x a log, y; 


(4) log, (x") = n-log, x. 


It is worth noting that there is no rule for the logarithm of the sum or difference of two 
numbers x and y. When using a pocket calculator, it is often only possible to use one of the 
bases e or 10. In the case of bases other than e and 10, it is best to transform the base of 
logarithms to one of the above bases. Therefore, we now derive a formula which presents a 
relationship between logarithms to different bases. Let 


x — log,t and y — log, t. 
Solving both equations for t, we get 
а =і= Ь. 


Taking now the logarithm to base a and applying the rules for logarithms presented above, 
we obtain 


log, (а*) = x-log,a =x = log, (P") = y log, b. 
Therefore, 


x  log,t 
log, b = - = —*. 
mmy log, t 


Considering now the left-hand side and the right-hand side from the above equality, we get 
the change-of-base formula 


log, t 
log, b` 


log, = 


As an example, if we want to determine log, 11, we can apply the above formula with base 
a = e and obtain 


log, 11 2.39789527... 


Ai = 1.4899, 
log, 5 1.60943791... 


log; 11 = 


where both logarithms on the right-hand side can be determined using a pocket calculator. 
The dots in the numerator and denominator of the last fraction indicate that we dropped 
the remaining decimal places, and the final result on the right-hand side is rounded to four 
decimal places. The same result is obtained when taking e.g. base 10, i.e. 


1оё10 11 __ 1.04139268... 


N = 1.4899. 
logio 5 0.69897000... 


log; 11 = 
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For the special cases of base e and 10, respectively, we use in the following the abbreviations 
log, b = Inb andlog;, b = lgb. 


To illustrate the application of the above logarithm rules, we consider some examples. 


Example 1.50 Let the logarithmic equation 
log; [3 + leg + 0| 22 
be given, and we solve it for x. Applying Definition 1.22 repeatedly, we get 


3-rlog4(x +1) = 22 = 4 
10984(х+ 1) = 1 
х+1=4!=4 


х= 3 


i.e. х = 3 is the only solution of ће given logarithmic equation. 


Example 1.51 Let 


а" * 4 
log, E :logy(y : a?) 


Z log, a 5 aril, >h V> 


be given. We simplify this logarithmic term as much as possible and obtain 


| leg, (а?) — log, y | . | log,» + log,(a*) | 
log, a 


log, y à: x log, a 
loga log, a 


lo 
= («= 10g, ( E ex) 
log, a 


А 


= (x — log, y) ? ( 


= (х — log, y) - (log, y + x) 


=x – (1,5). 


Example 1.52 We simplify the term 


m 
z= logs =» 
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by applying rules for logarithms as follows: 


1 | 1 
Z=-7:10 ER 
3 8x xly 


1 
= = (log, 1 — 2 log, x — log, y) 


1 
= 5: (0-2-1—Iog,y) 


чө 


= -; . (2 + log,y). 


Example 1.53 We consider the exponential equation 


o-o” 


and solve it for x. Such an equation is called an exponential equation since the variable x 
occurs only in the exponent. Taking the logarithm to base 10 (alternatively we can also choose 
any other positive base) on both sides, we obtain 


4 5 
(x — 2) lg (5) = (2x — 3) lg (5) 


which can be transformed further as follows: 


xlg (3) - 2x18 (5) = 216 (3) ~ 31g G) 
21 (3) ~ 31865) 
ts (3) - 216) 


x © 1.58963. 


X — 


1.4.3 Complex numbers 


The set IR of real numbers is still not sufficient for all purposes. For instance, consider the 
equation x? + 1 = 0 or the equation x? = —9 obtained in Example 1.49. These equations do 
not have a real solution, and so we wish to extend the set of real numbers such that all roots 
can be determined within an extended set of numbers. 


Let i be the symbol (‘number’) such that i? = —1 and we call i the imaginary unit. Then we 
can define the set of complex numbers as follows. 


Definition 1.23 The complex number system C is the set of all symbols z = a + bi, 
where a and b are real numbers. The number a = (z) is called the real part and 
b = Sz) is called the imaginary part of the complex number z. 
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The representation z = a + bi is called the Cartesian form of the complex number z. The 
number bi is also known as an imaginary number. The set of complex numbers contains the 
set of all real numbers: if b = 3(z) is equal to zero, we get the real number a + 0i = a. 
The number 2 = a — bi is the complex conjugate of number 2 = a + bi. We are now able to 
solve any quadratic equation within the set of complex numbers. 


Example 1.54 Let x? = —9, Because х2 = 9 - i? we obtain x; = 3i and x; = —3i. 


Example 1.55 We consider the quadratic equation 
х? +4x+13=0 


and obtain 


xı = -2 + м4 13 = —2 + vV —9 = —2 + 3i 


as well as 


x2 = —2 — v4 — 13 = —2 — V —9 = —2 – 3i. 


In the latter representations, we assumed that i = 4/ — 1 by definition. 


Definition 1.24 Let zı = a+ bi and z2 = c + di. The respective operations of 
addition, subtraction, multiplication and division are defined as follows: 
21 + 22 = (a+ bi) + (c + di) = (a + c) + (b + di, 
гү — Z2 = (a+ bi) — (c+ di) = (a — с) + (b — d)i, 
21:22 = (a + bi) - (c + di) = (ac — bd) + (ad + bc)i, 
zı (a+ bi)-(c—di) — (ac- bd) + (bc — ad)i 
z  (c- di)- (c — di) с2 + d? ` 


When computing the quotient 21 /22, the denominator is transformed into a real number 
by multiplying both the numerator and the denominator by 22 = c — di. From the above 
definitions, we see that the sum, difference, product and quotient of two complex numbers 
again yield a complex number. 


For the power of a complex number we obtain 


z” = (a + bi)” = (a + bi) - (a + bi)-...- (a+ bi) 
_ EE Maamaa 
n factors 


For the above computations we can apply Theorem 1.11. However, for large numbers n a 
more convenient calculation of a power is discussed later. 
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Example 1.56 Letz; = —2 + 2i and z2 = 3 + i. Then: 


2 +25 =(-2+3)4+ 2 + 1) = 1 + 3i, 
21 — 22 = (—2 – 3) + (2 — l)i = –5 + i, 
21:22 = (—6 — 2) + (—2 + 6)i = —8 + 41, 
zy —2+2i (—-2+290-(3—) —4+8i 2 


4 
= = = = + =i. 
Z2 34i (3--D-(3- i) 10 5 5 


We can graph complex numbers in the plane by the so-called Argand diagram (see Figure 1.8). 
The x axis represents the real part of the complex number z, and the y axis represents the 
imaginary number (i.e. the imaginary part together with the imaginary unit). Then each 
point in the plane represents a complex number and vice versa. The points on the real axis 
correspond to the real numbers. 


From Figure 1.8, we can derive another representation of a complex number z. Instead of 
giving the real part a and the imaginary part b, we can characterize a complex number by 
means of the angle ф between the real axis and the line connecting the origin and point 
2 = а + bi as well as the length of this line. We get 


a =r coso and b=rsing. 


Imaginary axis 
z—adbi 


bi 


-2 -1 2 a Real axis 


—i 


bi 


Figure 1.8 Argand diagram. 


Therefore, we have 
z=a+ bi = ғ: (coso + isin q). 


The right-hand side representation of number z is called the polar (or trigonometric) form 
of the complex number z. The number 


r= |z| = уа? + b? 
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is denoted as the modulus and ¢ is denoted as the argument of the complex number z. The 
argument ¢ is given as an angle either in degrees or in radians. 


Relationships between polar and Cartesian coordinates 


Letz = a + bi =r - (coso + ising). Then 


(1) a =r cosọ, b =r sing; 


(2) r= |z| = Va +b, PE 
a 


А а А b 
(or equivalently соѕф = – and sing = 2. 
Р r 


Part (1) of the above relationship can be used for transforming a complex number given in the 
polar form into the Cartesian form whereas part (2) can be used for transforming a complex 
number from the Cartesian form into the polar form. We give two examples for transforming 
a complex number from one form into the other. 


Example 1.57 Let z = —1 + 4/31 be given in Cartesian form. To get the number z in polar 
form, we determine the modulus r and the argument ф of the complex number z. We obtain 


r= || 2-02 + (V3)? = V4 =2 
and 


1 b 1 
cosg = Ê es: sing = – = Мз 
r 2 


r 2 
: sing b 
or equivalently, tang = —— = – = —М3 |. 
cosp а 
For 0° < ф < 360°, the equation cosy = —1/2 has the two solutions фу = 120? and 


фә = 240°. Analogously, equation sing = 4/3/2 has the two solutions фз = 60° and 
фа = 120°. Since both equations must be satisfied, we have found р = 120°. This result can 
also be obtained without considering the sine function. The second solution ф2 = 240° of 
the cosine equation cannot be the argument of number 2, since a = —1 < 0 and b = v3 > 0 
(i.e. number z is in the second orthant, which means that 90° < o < 180°). If we operate 
with the tangent function, we obtain 


1 1 
prd. ale 
ano 73 


where two solutions exist: фу = 120? and ф› = 300°. However, since we must have 
ф € [90°, 180°] because a < 0 and b > 0, we would also obtain x = 120° in this case. We 
have found number z in polar form: 


z = 2. (cos 120? + isin 120°). 


Introduction 51 


Example 1.58 Let = 2 - (cos 225° + i sin 225°). Since 
1 ; 1 
cos 225° = = %2 and sin 225° = -5V2 


we get the number z in Cartesian form as follows: 


г ( v2 E СМ2—/21, 


2 2 


i.e. a = b = —\/2. 


The above transformations require some knowledge about trigonometric functions. Some 
basic properties of such functions are summarized in Chapter 3.3 when dealing in detail with 
functions of one real variable. 


We summarize some rules for operations with complex numbers in polar form. 
Rules for complex numbers in polar form 


Let zı = r1- (cos фу + ising) and z2 = r2 · (cos фэ + isin фо). Then 


(1) z 22 = r1 r2: | cos (yi + e) + isin (yi + 92) |; 
21 T] aoe 

(2) “=”. | cos (ру - p2) + isin (f1 - en |. 
Z2 r2 


We multiply two complex numbers in polar form by multiplying their moduli and adding 
their arguments. Similarly, we divide two complex numbers in polar form by dividing their 
moduli and subtracting their arguments. For performing the addition or subtraction of two 
complex numbers, the Cartesian form is required. Next, we derive a formula for the power 
of a complex number in polar form. 


тү = ro; = 1 and фу = 9» = ọ, then the above formula for 21 - z2 reads as 
. 2 . 
(cos + ising) = соз 20 + isin 20. 
By induction, we can obtain de Moivre 5 formula: 


(cos o +i sing)” = cosnqo + i sin nọ, n —1,2,3,... 
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Using de Moivre's formula, we can easily compute powers of complex numbers. Let z — 


r - (cos o + isin ọ) be given in polar form. Then 


n 


n 
ges z (cos + ising)| =r". ( cosng + isin ng). 


Example 1.59 We determine z = (1 — i)^. We transform w = 1 — i into the polar form and 
obtain 


1 1 1 1 
|w|2V12--(—-1? 242 and cosp = — = 202, зшр=———=——\/2. 
42. 2 ~ 2 


Since the cosine value is positive and ће sine value is negative, the argument ¢ is in the 
interval [270°, 360°]. From cos 45° = cos(360°—45°) = /2/2, we obtaing = 360°—45° = 
315°. This gives us 


4 
z= |v2- (cos 315° + isin 315°) | 
= (/2)* . (cos 1, 260° + isin 1, 260°). 
Since 


cosx = cos(x + k - 360°) = cos(x --2kz) and 
sinx = sin(x + k - 360°) = sin(x + 2kz) 


for k € Z, we obtain 1, 260° — 3 - 360° = 180° and therefore 
z =4- (cos 180? + isin 180°) = 4. (—1 + i - 0) = —4. 

The same result is obtained when using the Cartesian form directly: 
т=(1—0%=1—4 + —4 p = —4. 


The latter is obtained due to i? = —i and i^ = 1. 


Definition 1.25  Letzo є Candn є N. Every complex number satisfying the equation 
z” = zo is called the nth root of zo, and we write z = 1/20. 


To derive a formula to determine all the roots of the above equation 2" = 20, let 


z=r-(cosg+ising) and Zo = ro (cos фо + i sin Qo). 
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Then 2” = 20 can be written as follows: 


z” =r". (cosng + isin ng) = ro · (cos qo + isin qo) = Zo. (1.5) 


In equation (1.5), we can compare the moduli and the arguments of both complex numbers, 
and we obtain 


к” — rn and пф = фо + 2kn, К € Z. 
Notice that the right equation concerning the arguments is valid due to the periodicity of the 
trigonometric functions, i.e. sinx = sin(x + 2kz), cosx = cos(x + 2kz), where k є Z. 


The above equalities for finding r and ф can be rewritten as 


фо + 2kz 
диш 


r= Aro and 


Thus, the equation z” = д0 = rg - (cos qo + i sin go) has the n solutions 


, k=0,1,2,...,.2-1 (L6 


+ 2kr : + 2kz 
Zip] = Vro: (cos eee +i sin Ое. ) 
п п 


For k > n, we would again get consecutively the solutions obtained for k = 0,1,2,..., 
n — 1. Therefore, we have exactly n roots of the equation z” = zo. Writing the argument of 
the complex numbers in degrees instead of radians, equation (1.6) can be written as follows: 


k - 360° k - 360° 
у. (cos AE |, sin MEE) k —0,12,...,n— 1. 
n n 


Zk+1 = 


Example 1.60 We determine all roots of z+ = 1. Writing z = 1 = 1 + 0i in polar form 
yields 


z^ = cos 0° + i sin 0°. 


Applying the formula, we obtain 


zı = 1-(cos0° +isin0°) = 1+0 = 1, 


360° 360° 
ZINC 1 +isin 4 ) = 00590? + sino" = 0-4 1i =i, 


720° 720° 
23 = (cos ri + isin 4 ) = 00s 180° + isin 180° = —1 +0i =1, 


1, 080° . 1,080° | : 
ZINC á + isin 4 ) =ов270° + isin270° = 0 - 1i = =i. 
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The four solutions are illustrated in an Argand diagram in Figure 1.9. They are located on a 
circle with the origin as centre and radius one. 


Imaginary 
axis 4 
ij 22 
23 21 
= 
-1 1 Real axis 
— 1 Z 4 


Figure 1.9 Roots 2] — 24 in Example 1.59. 


Example 1.61 Let the equation 


3 1-i 1+i 
т + E 


ETE E 


be given and we determine all solutions z. We obtain 


4, ü-i-ü-i? 1-42i-1-1-42i-41 | 
i = = 2i. 
1—? 2 


The modulus of z? is therefore equal to two and the argument о is equal to 90? (since 2i is 
on the positive section of the imaginary axis). Thus, we have 


z? = 2. (cos 90? + isin 90°). 


Now, we obtain the three solutions 


90° 90° 1 1 
z eal Ds (cos у + isin) = A/2 - (cos 30° + isin 30°) = /2- "E D 


90° + 360° 90° + 360° 
n= 42. c T + isin V.) ED (cos 150? + isin 150°) 
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90° + 720° 90° + 720° 
з = 2s c T + isin ae SNT (cos 270° + isin 270°) 
= 4⁄2. (0 -D = —V2i. 


The three solutions are illustrated in an Argand diagram in Figure 1.10. 


Imaginary 
axis 4 
МЕД 
22 zi 
- 
V2 Real axis 
23 


Figure 1.10 Roots 2] — 23 іп Example 1.61. 


The following representation for an exponential term with imaginary argument is called 
Euler 5 formula, where angle ¢ is presented in radians: 


e? = cosy +i sing 
Using Euler’s formula, we obtain the equality 
z—r.e =r. (coso +i sing). 


From the latter equality, we can introduce the exponential form as the third representation of 
a complex number: 


z-—r.e", 


which is immediately obtained from the polar form, since it uses the same two variables r 
and o. (Again angle 9 is presented in radians.) 


We can summarize the following rules for the product and the quotient of two complex 
numbers as well as for the power of a complex number using the exponential form. 


56 Introduction 
Rules for complex numbers in exponential form 
Let zı = rı · (cos gı + isin g1), 22 = r2 · (cos фо + ising), 2 —r- (coso + ising) and 
Zo = ко. (cos qo + isin qo). Then 
(1) Z|-Z2—ri-:r- eet» 
о) ®='1.%-е›, 
22 r2 ' 


(3) z^ = ү". e"? and 


(4) хы = 2/20 = Уту. еї Pot2ka)/n, k=0,1,2,...,n—1. 


Example 1.62 Let 


z e" and z =2.е7/%, 
i.e. 

n=1, m=2, ф= =, and Ф = 2. 
Then 


Zp zy = 2+ eNA) — 2. 3їл/%, 


Using the exponential form of a complex number, we can easily determine the logarithm of 
a complex number. Let 


z=r-e? =r. еЧе+?Кт), kez. 
Taking the logarithm we obtain 
Inz = In L . dip) 
—]Ànr-i(p-2kn) kez. 


From the above formula we see that the imaginary part of Inz is not uniquely determined. 
For simplicity, one takes the imaginary part using k = 0. 


Example 1.63 Let = —12, i.e. z = 12 - e" , and we determine the logarithm: 
Inz = In 12 + i - z ~ 2.4849 + wi. 


Notice again that in the above representation, the argument ¢ is given in radians. 


From Example 1.63 we see that we can compute the logarithm of a negative number within 
the set of complex numbers. 
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EXERCISES 


1.1 


Which of the following terms are propositions? If they are propositions, what is their 
truth value? 


(a) 2+1= 15; (b) Please hold the line. 
(с) [ras (d) 5 < 10; 
(e) 3 is an even number. 


Given are the following propositions 


A Peter meets Ann. 
B Peter meets Betty. 
C Peter meets Cindy. 


Express the following propositions by use of conjunction, disjunction and (or) negation 
of the given propositions: 


Peter meets all three girls. 

Peter meets only Ann. 

Peter meets at least one of the three girls. 
Peter meets exactly one of the three girls. 
Peter meets no girl. 

Peter does not meet all three girls. 


~man 


Verify by means of truth tables that 
(а) AAB&>AVB; (b (45 B) < (В > A) 


are tautologies. 
Find out in which cases A is sufficient for B or A is necessary for B or both. 


(a) A:x eN is even; (0) A:x/3EN; 

B:x/2 €N; B : x is integer; 
(с) A:x <4; (d A:x? = 16; 

B:x* < 16; В: (х= 4) V (х = —4y 
(е) А: (х> 0) л (у> 0); 

В:ху > 0. 


Check whether the following propositions аге true for x є К: 


(a) Ax? — 5x + 10 > 0; 
(b) A,x?—- 2х > 0. 


Find the negations of the propositions and give their truth values. 


Prove indirectly 
1 
x+->2 
x 


with the premise x > 0. Use both methods, proof by contradiction and proof of 
contrapositive. 
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1.7 


1.8 


Prove by induction: 
n 
(а) Y:Qi- nam 
i=1 


n 

(b+) У? = 
i=1 

(c) Leta > 0 and n = 1,2,.... Prove the inequality (1 + a)" > 1 + па. 


, 


Бе 


Which of the propositions 
les; 0 є 5; 2&8; -1¢8S 


are true and which are false in each of the following cases? 


(a) S = {1,2}; (b S={x|x?+2x—3=0}; 
(с) S = {0,1,2}U {—1, 0}; (d S = {0,1,2}M (-1,0). 


Let A = {1,3,5,7,9, 11} and B = {1,2,3,7, 8,9}. Find AUB,ANB, |A], |B|, |4 U В|, 
|A N B, AN Band |А V B]. 

Given is A = {1,2}. Enumerate all the subsets of the set A. What is the cardinality of 
set P(P(A))? 

Illustrate by means of Venn diagrams: 

(а) AU (ВПС) = (AUB)n (AUC); 

(b) (AUB)\ (ANB) = (4\ В) U (BV A). 

Prove that (4 О B) V (AN B) = (AN В) U (BV A). 

8 students of a group study mathematics, 13 students of the group do not live in the 


student hostel, and 17 students study mathematics or do not live in the student hostel. 
How many students of the group studying mathematics do live in the student hostel? 


Of 1,100 students studying economics at a university, 550 students have a car and 
400 students have a PC, and 260 students have neither a car nor a PC. Determine the 
number of students having a car or a PC, the number of students having both a car 
and a PC, the number of students having a car but no PC and the number of students 
having a PC but no car. 

Given are A = {1,2}, B = {2,3} апа С = {0}. Find A x А, Ax B, Bx A,A x C, 
A x C x Band A x A x B. 


Let the sets 
M,={xeR|1<x<4}, Mo={yeR|-2 <y <3}, 
Мз = {ze К |0 <2 <5} 


be given. Find Mı x M» х M5. Illustrate this Cartesian product, which corresponds 
to a set of points in the three-dimensional space, in a rectangular coordinate system. 
There are 12 books on a shelf. How many possibilities for arranging these books exist? 
How does this number change if three books have to stand side by side? 

A designer wants to arrange 3 blue and 5 red buttons in a row. How many possibilities 
are there for arranging them? 


1.22 


1.23 


1.24 


1.25 


1.26 


1.27 
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Given the set ofnumbers {1, 2,3, 4, 5}, how many three-figure numbers can you select? 
How does the answer change when all the figures have to be different? 


A group of n people all say ‘hello’ to each other once. How many ‘hellos’ can 
somebody from the group hear? 


In the vicinity of a holiday resort 15 walks are to be marked with two coloured parallel 
lines. What is the minimum number of colours required to mark all footpaths if both 
lines can have the same colour and the sequence of colours cannot be inverted? 


A set of weights consists of weights of 1, 2, 5, 10, 50, 100 and 500 units. 


(a) How many combinations of these weights are possible? (Use combinatorics and 
consider the cases when first no weight is selected, then exactly one weight is 
selected, and so on.) 

(b) Using your knowledge about sets, explain that for a set of n weights there are 
exactly 2” combinations of weights. 

(c) Prove by induction: 


2 


Simplify the following fractions: 


(a) 4x + 5y 3x — 2y s 15у? . 
x? + 2ху 4y? + 2xy 3x?y + 6xy? , 
a b 


Solve the following inequalities: 


3 
(a) 5x+3 > —2x — 4; (b) < 2; 
2-1 2 v 
х^ – х 
< ©; шш 1. 
© xt1 7 x’ (d) Hua PEE 


Solve the following inequalities: 


(а) |x| «2; (b |x-3| «2; (с) |2x —3| < 2; 
— 1 
( k-4 «bed; ( En 


Simplify the following terms: 
4-542 11/2 24b ^? 8b c^ 


3 b : | 
ч) 2-3/2 e O — 23b 
Solve the following equations for x: 
(a) 4/3x—-9— J2x—5- 1; (b 92-1 — 36.3343 = 0; 


(с) 24х+2 +3 = 4/15 + 3x. 
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1.28 


Simplify the following terms: 


log, N 
(а) а= —logs (log; VV3); — (b) x = yoy — log, b. 


Solve the following equations for x: 
(а) lini +3in x -31n Jx = 2 (In2 + In3); 


(b) y2- 023-1 x? = 1 £ 0; 


(c) log; x + log хх — logi x = 6. 
Find the roots of the following equations and illustrate them in an Argand diagram: 


(а) x?—3x49-20; (b) x*-- 13x? + 36 = 0. 


Illustrate the set of all complex numbers 2 with |2 — i| < 4 in an Argand diagram. 


Find the sum, difference, product and quotient of the following complex numbers: 
zı = l + 4i; Z2 = —2 + i. 
Find the Cartesian form of the following complex numbers: 


(a) z= 1; 
2 

6) z= 4); 

(с) z= re? with r = 24/3 and o = —277/3. 


Find the polar and exponential forms of the following complex numbers: 
1. _(1+г\. 
i? (b) 2 = (1 = i) , 
(c) 2232333: (d z— 


2—i 
ETE 
Given the complex numbers zı = 2 — 2i and z2 = 1 + i, find zi and z1 /z2 by using the 
polar form. Compare the results with those you get by means of the Cartesian form of 
the complex numbers. 


Show that i! is a real number. 


Find the solutions of the following equations: 


(a) z4 = —8 + 8/37; b) 2+ фі = 15. 


d 
Find the real numbers a, and a so that z = 4(cos 40° + i sin 40?) is a solution of the 
equation 
; m 3a) _ 9 
5i+2(1 — D? 


2 Sequences; series; finance 


This chapter deals first with sequences and series. Sequences play a role e.g. when solving 
difference equations. Special types of sequences and series are closely related to problems 
in finance, which we discuss at the end of this chapter. 


2.1 SEQUENCES 


2.1. Basic definitions 


Definition 2.1 If to each positive integer n є М, there is assigned a real number ay, 
then 


{an} = 41,42,а3,...,аһ,... 


is called a sequence. 


The elements a1, a2, a3,... are called the terms of the sequence. In particular a, is denoted 
as the nth term. There are two ways of defining a sequence: 


(1) explicitly by giving the nth term of the sequence; 
(2) recursively by giving the first term(s) of the sequence and a recursive formula for 
calculating the remaining terms. 


This is illustrated by the following example. 


Example 2.1 (а) Consider the sequence {a,} with 


n—1 
= ——_, EN. 
ойык 
We get the terms 
1 2 3 4 
а= 0, a= аз = a4 = а5 = 


In this case, sequence {an} is given explicitly. 
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(b) Let sequence {b,} be given by 
bi =2 and bn+1 = 2b, — 1, neN. 


Then we get the terms 


The latter sequence is given recursively. 
(c) Let sequence {cn} be given by 


Cn = (—1)", n € М. 
In this case, we get the terms 
q=-l, &=1, cec3—-1, c4=1, cs =-l, 


This is an alternating sequence, where any two successive terms have a different sign. 


Arithmetic and geometric sequences 


We continue with two special types of sequences. 


Definition 2.2 A sequence {an}, where the difference of any two successive terms is 
constant, is called an arithmetic sequence, i.e. 


ап+1 = ап — d гап є N, where d is constant. 


Thus, the terms of an arithmetic sequence with the first term ај follow the pattern 
а, at+d, а +24, а +34, ..., aj (n— Dd, 
and we obtain the following explicit formula for the nth term: 


Qn — а + (n — Dd for all n € №. 


Example 2.2 А car company produces 750 cars of a certain type in the first month of its 
production and then in each month the production is increased by 20 cars in comparison with 
the preceding month. What is the production in the twelfth month? 

This number is obtained as the term a12 of an arithmetic sequence with aj = 750 and d = 20, 
and we get 


ay2 =a, + 11 -d = 750 + 11-20 = 970, 


i.e. in the twelfth month, 970 cars of this type have been produced. 
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Definition 2.3 А sequence {an}, where the ratio of any two successive terms is the 
same number q = 0, is called a geometric sequence, i.e. 


Qn+1 
an 


= 9 for all n є №, where q is constant. 


Thus, the terms of a geometric sequence with the first term a; follow the pattern 
a1, а1 `9, aq, a-g, ttt а1 `4 > 
and we obtain the following explicit formula for the nth term: 


ап = а sgid for all n € N. 


Example 2.3 Consider sequence {an} with 
аһ = (-1)"-4", пєй. 
The first term is 


1 
a; = (1)! 47 e 


Using 
asi = (VE AOD and а = (- D" - 4^, 
we obtain 
ап+1 = (-1) .47n-ltn m —41 — ES 
an 4 
Thus, {an} is a geometric sequence with common ratio q = —1/4. 
q 


Example 2.4 А firm produced 20,000 DVD players in its first year 2001. If the production 
increases every year by 10 per cent, what will be the production in 2009? 
The answer is obtained by the ninth term of a sequence {an} with q = 1 + 0.1: 


ay = ај: qÈ = 20,000. 1.18 ~ 42, 872. 


Next, we investigate what will be the first year so that under the above assumptions, 
production will exceed 55,000 DVD players. From 


a, = 55,000 = 20,000- 1.1"! = a, · 4"! 
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we obtain 
5,000 
11712 230 = 2.75 
20,000 
and 
In 2.75 1.0116 
п—1= x де 10.61, 


© Inll 0.0953 


i.e. n © 11.61. Thus, 2012 will be the first year with a production of more than 55,000 DVD 
players. 


Next, we introduce some basic notions. 


Definition 2.4 А sequence {a,} is called increasing (resp. strictly increasing) if 


аһ < ап+1 (resp. an < а) for all n € N. 


Sequence {an} is called decreasing (resp. strictly decreasing) if 


аһ > ап+1 (resp. an > аһ) foralln € N. 


A sequence {a,} which is (strictly) increasing or decreasing is also denoted as (strictly) 
monotone. When checking a sequence {an} for monotonicity, we investigate the difference 
D, = ані — an of two successive terms. If D, > 0 (D, > 0) for all п є N, sequence 
{an} is increasing (strictly increasing). If D, < 0 (D, < 0) for all n є N, sequence {an} is 
decreasing (strictly decreasing). 


Example 2.5 We investigate sequence {an} with 
a, = 2(п — 1? —n, neN, 
for monotonicity, i.e. we investigate the difference of two successive terms and obtain 


dnd = an= 2n SED) = [26 - 1)? -n| 


= 97 cope (2 4n 2—n) 
= 4n — 3. 


Since 4n — 3 > 0 for all n € №, we get а„+1 > an for all n € N. Therefore, sequence {an} is 
strictly increasing. 
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Definition 2.5 А sequence {an} is called bounded if there exists a finite constant C 
(denoted as bound) such that 


|а| x € for all n є №. 


Example 2.6 We consider the sequence {an} with 


(095-2 
= 2 , 


N, 


ün 
n 


and investigate whether it is bounded. We can estimate |a, | as follows: 


(—1)”.2 


2 


2 
ЕЕЕ for alln є №. 
п п 


|an| = 


Therefore, sequence {an} is bounded. Note that sequence {an} is not monotone, since the 
signs of the terms alternate due to factor (—1)”. 


2.1.2 Limit of a sequence 


Next, we introduce the basic notion of a limit of a sequence. 


Definition 2.6 A finite number a is called the limit of a sequence {an} if, for any 
given £ > 0, there exists an index n(€) such that 


lan —a| < € for all n > n(e). 


To indicate that number a is the limit of sequence {a,}, we write 


lim a, = a. 
n—oo 


The notion ofthe limit a is illustrated in Figure 2.1. Sequence {an} has the limit a, if there exists 
some index n (s) (depending on £) such that the absolute value of the difference between the 
term a; and the limit a becomes smaller than the given value = for all terms a, with n > n(e), 
i.e. from some n on, the terms of the sequence {an} are very close to the limit a. If = becomes 
smaller, the corresponding value n(e) becomes larger. To illustrate the latter definition, we 
consider the following example. 
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Qn, * 


Figure 2.1 The limit a of a sequence {ay}. 


Example 2.7 Let sequence {an} be given with 


а= 1+ E neN, 
i.e. we get 
5 10 17 
a, = 2, a2 = 2, аз = y> 44 = ig 


The terms of sequence {an} tend to number one. For example, if = = 1/100, we get 


| 1| 1+ l 1 l : 
an — = = < 
А n2 n? 100° 


which is satisfied for n > n(e) = 11, i.e. all terms a, with n > 11 have an absolute difference 
from the limit a = 1 smaller than є = 1/100. 
If we choose a smaller value of є, say = = 1/10, 000, we obtain for n > n(s) = 101 only that 
1 1 
< $ 
n? 10,000 


la, — 1] = 


i.e. all terms a, with n > n(s) = 101 have an absolute difference smaller than = = 1/10, 000 
from the limit a — 1 of the sequence. Note, however, that independently of the choice of e, 
there are infinitely many terms of the sequence in the interval (a — &,a 4- в). 
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The definition of a limit is not appropriate for determining its value. It can be used to confirm 
that some (guessed) value a is in fact the limit of a certain sequence. 


Example 2.8 Consider the sequence {an} with 


n—2 


an = —— n cN. 
2n 4-3 
We get 
1—2 1 2—2 3-2 1 
aj = = ——, а = = U, Q3 = =-, 
2-143 5 2.2403 2.343 9 
For larger values of n, we obtain e.g. 
zs cts = 0.466 = = 0.483 = = 0.498 
450 = 103 40660, aio = zoz X 0433, а„оо = ооз 


and we guess that a = 1/2 is the limit of sequence {an}. To prove it, we apply Definition 2.6 
and obtain 


| ри ge-2- 0n 3| [2n7 4722-3 
Өл ON an ps: онаа | 4n 4-6 
—7 7 
= = < 6 
4n + 6 4n + 6 


By taking the reciprocal values of the latter inequality, we get 


4n 4- 6 1 
> — 


7 € 


which can be rewritten as 


7 — 6e 
4e ' 


n> 


i.e. in dependence on ғ we get a number n = n(e) such that all terms а, with n > n(e) deviate 
from a by no more than ғ. For e = 107? = 0.01, we get 


— 6.0.01 
n> жоо. х = 173.5, 
4 . 0.01 


ie. we have n(e) = 174 and from the 174th term on, all terms of the sequence deviate by 
less than = = 0.01 from the limit a = 1/2. If e becomes smaller, the number п(є) increases, 
e.g. fore = 107^ = 0.0001, we get 
7 — 6 - 0.0001 
4 - 0.0001 


i.e. п(=) = 17,499. 


~ 17,498.5, 
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Ifa sequence has a limit, then it is said to converge. Now we are able to give a first convergence 
criterion for an arbitrary sequence. 


THEOREM 2.1 A bounded and monotone (i.e. increasing or decreasing) sequence {an} 
converges. 


We illustrate the use of Theorem 2.1 by the following example. 


Example 2.9 Consider the recursively defined sequence {an} with 
ај = 1 апа ап+1 = V3 ag, neN. 


First, we prove that this sequence is bounded. All terms are certainly positive, and it is 
sufficient to prove by induction that a, < 3 for all n є N. In the initial step, we find that 
ај = 1 < 3. Assuming a, < 3 in the inductive step, we obtain 


ап+1 = V3 -an < У3:3 = 3 
for all n € N. Therefore, implication 
аһ < 3 => ап+1 < 3 
is true. Hence, sequence {an} is bounded by three. 


To show that sequence {an} is increasing, we may investigate the quotient of two successive 
terms (notice that a, > 0 for n € N) and find 


anyi М3 аһ _ 3 x 
аһ an an à 


The latter inequality is obtained due to a, < 3. Therefore, арф /ар > 1 and thus ал+1 > an, 
i.e. sequence {an} is strictly increasing. Since sequence {an} is bounded by three, we have 
found by Theorem 2.1 that sequence {an} converges. 


Now, we give a few rules for working with limits of sequences. 


THEOREM 2.2 Assume that the limits 


lim a, =a and lim b, = b 
n— oo n— oo 


exist. Then the following limits exist, and we obtain: 


(1) lim (a, Ec) = lim (a,)+c=a+tc  forconstant c є R; 
n— oo n—oo 


(2) lim (c-a,)=c-a for constant c € R; 
п со 


(3) lim (a, + bn) =a+b; 
n— oo 
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(4) lim (а, - b) =a-b; 
n—oo 
. an a 
5) I — —— b x 0). 
(5) lim Du EE (b #0) 


We illustrate the use of the above rules by the following example. 


Example 2.10 Let the sequences (a,) and {b,} with 


212 -4n —1 Зи2 —1 zd 
a — 0 VERD AE d TOR WALLS n , 
1 5n2 + 10 P m-2n 


be given. Then 


P i-i) 


| . 2$) + 4п – 1 п 
lim a, = lim OSSGqG 7 im ———— ———— 
n—oo noo Sni + n> y2 (5+ 19) 
lim 2+ li : li ! 
lo Heri. mium. Qu. 3 
= 10 B 585 
lim 5+ lim —; mee? 39 
n— oo п 00 n 
Similarly, we get 
Зи2 – 1 п? (3 - +) 
lim b, = lim z= lim —————-— 
поо поо p? +2n noo n2 (n+ 2) 
n 
lim 3 — li ! 
LX lima 3-0 
= EET = 
iia du. РЕ? 


n—oo поо n 
Consider now sequence {cp} with 
Cn = An + by, пєй. 


Applying the given rules for limits, we get 


2 
lim c, = lim a, + lim b, = - +0 = =. 
n—oo n— 00 поо 5 5 

As ап alternative to the above calculation, we can also first determine the nth term of 


sequence {cy}: 
2n? --4n – 1 E 3582-1 
5(n? + 2) п(п? + 2) 
1. Оп? + 4n? —n)+3n?-1 2m + Pn? -in-i 
Е пЗ + 2n B n? + 2n 


Cn = ag d bp = 
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Determining now 


lim с, 
n— oo 


we can get the same result as before. 


In generalization of the above example, we can get the following result for p,q є N: 


0 forp <q 

ип арт? + ay iP +... + ап + ао _ ар forp—4 
noo bond + bg-int!+...+bin+bo | 5s И 

оо Or p > 4. 


This means that, in order to find the above limit, we have to check only the terms with the 
largest exponent in the numerator and in the denominator. 


Example 2.11 We consider sequence {an} with 


3n+2\7 
= 2, € N. 
i (745) t i 


We finish this section with some frequently used limits. 


Some limits 


1 
(1) lim - = 0; 
noon 
(2) lim а" 20 for |a| « 1; 
n— oo 


(3) lim Ус=1 for constant c € R,c 0; 
n— oo 


(4) lim J/n- 1; 
n— oo 


n 


(5) lim < — 0; 
n—oo n! 
nk 

(6) lim — — 0; 
n—oo n! 
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! 
(7) lim  — 9; 


n—oo и! 
1^ 
(8) lim (1+ 3 =e. 
поо n 


From limits (5) up to (7), one can conclude that the corresponding term in the denominator 
grows faster than the term in the numerator as n tends to infinity. 


2.2 SERIES 


2.2.1 Partial sums 


We start this section with the introduction of the partial sum of some terms of a sequence. 


Definition 2.7 Let {an} be a sequence. Then the sum of the first n terms 
n 
эп = 1+0 +... +а = } ag 
K= 


is called the nth partial sum sp. 


Example 2.12 Consider the sequence {a,} with 


2 
ап =34+(-1)"--, пєй, 
п 


Le. 
=3-1-2=1 =3+1= 4 = 3 ы аа 
а = SS) a2 = Sy, a3 — 3-4 а4 = 2 » 
Then we get the following partial sums: 
7 22 
d ul E E rU E Ka a 
7 7 65 
= ауаз АВЕ ААА duci ушы; 


For special types of sequences (namely, arithmetic and geometric sequences), we can easily 
give the corresponding partial sums as follows. 
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THEOREM 2.3 The nth partial sum of an arithmetic sequence {an} with a, = a1 + (n—1)-d 
is given by 


з» = 7° (а + an) = 5: [2а + (1— Dd]. 


n 


The nth partial sum of a geometric sequence {an} with a, = а. q"! and q Æ 1 is given by 


Example 2.2 (continued) We determine the total car production within the first twelve 
months of production. To this end, we have to determine the twelfth partial sum 512 of an 
arithmetic sequence with ај = 750 and d = 20. We obtain 


812 = > ` a +11- d) = 6 - (1,500 + 11.20) = 10, 320, 


i.e. the total car production within the first year is equal to 10,320. 


Example 2.13 Consider a geometric sequence with a1 = 2 and q = —4/3, i.e. the next 
four terms of the sequence are 


8 32 128 512 
2 = – 5, аз = —, a4 = -57> as = чү: 


According to Theorem 2.3, we obtain the fifth partial sum as follows: 


д, 1-3) E =. 12 362 
sth ni ARD lani ор ( 243 —=2. 73 _ 


leu 1—(—2) 7 243 7 81° 


Example 2.4 (continued) We wish to know what will be the total production of DVD 
players from the beginning of 2001 until the end of 2012. We apply the formula for the 
partial sum of a geometric sequence given in Theorem 2.3 and obtain 


ET 1—1.1!? 20,000 
— 20,000 . = 
1-4 1—11 —0.1 


; (1 11?) ~ 427, 686. 


S12 = 41 · 


The firm will produce about 427,686 DVD players in the years from 2001 until 2012. 


Example 2.14 Two computer experts, Mr Bit and Mrs Byte, started their jobs on 1 January 
2002. During the first year, Mr Bit got a fixed salary of 2,500 EUR every month and his 
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salary rises by 4 per cent every year. Mrs Byte started with an annual salary of 33,000 EUR, 
and her salary rises by 2.5 per cent every year. 

First, we determine the annual salary of Mr Bit and Mrs Byte in 2010. Mr Bit got 
30,000 EUR in 2002, and in order to determine his salary in 2010, we have to find the 
term ag of a geometric sequence with a; = 30,000 and q = 1.04: 


ag = 30,000 - 1.04% = 41, 057.07. 


To find the salary of Mrs Byte, we have to find the term Ро of a sequence with bı = 33,000 
and q = 1.025: 


bo = 33,000 - 1.0258 = 40, 207.30. 


In 2010, the annual salary of Mr Bit will be 41,057.07 EUR and the annual salary of Mrs 
Byte will be 40,207.30 EUR. Next, we wish to know which of them will earn more money 
over the years from 2002 until the end of 2010. To this end, we have to determine the partial 
sums of the first nine terms of both sequences. For sequence {an}, we obtain 


ee ы appe ыш огада вв 
uc quoq rcm Ге 
For sequence {bn}, we obtain 
1—4? 1 — 1.025? 
$92 Бу. 4 33,000. —— ———. ~ 328,499.12. 
1—4 1 — 1.025 


Hence, Mrs Byte will earn about 11,000 EUR more than Mr Bit over the nine years. 


2.2.2 Series and convergence of series 


Definition 2.8 The sequence {sn} of partial sums of a sequence {an} is called an 
(infinite) series. 
A series {sn} is said to converge if the sequence {sn} has a limit s. This value 


n оо 
s = lim s, = lim aa: = У`а 
п оо п оо 
k=1 [il 


is called the sum of the (infinite) series {sn}. 
If {sn} does not have a limit, the series is said to diverge. 


Example 2.15 We investigate whether the series {s,} with 


m= neN, 
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converges. To this end, we apply Theorem 2.1 to the sequence of partial sums {s,}. First, we 
note that this sequence {sn} is strictly increasing, since every term a, = 2/K? is positive for 
all k є № and therefore, 5, = Sn + an > Sy. It remains to prove that sequence {s,} of the 
partial sums is also bounded. For k > 2, we obtain the following estimate of term ак: 


2 2 2 sf a 1 
ar = — = . 
eR к kk-ln k-1 k 


Then we obtain the following estimate for the nth partial sum s,: 


k=2 
=2+2 І Е + l І + І : + 
B 2-1 2 3-1 3 4-1 4 


The last equality holds since all terms within the brackets are mutually cancelled except the 
first and last terms. Thus, the strictly increasing sequence {sn} of partial sums is also bounded. 
From Theorem 2.1, series {s,} with 


n 
2 
Sg = у Bà 
k=1 


converges. 


THEOREM 2.4 (Necessary convergence criterion) Let the series {s,} with 


n 
Sy = Уу ак, neN, 
k=1 


be convergent. Then 


lim an = 0. 
n— oo 


The condition of Theorem 2.4 is not sufficient. For instance, let us consider the so-called 
harmonic series {s,} with 


n 
1 
Sn = у, p ne N. 
k=1 
Next, we show that this series does not converge although 


lim a, = lim - = 0. 
n— oo noon 
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Using the fact that for n > 4,n € N, there exists a number k € N such that 25tl < оп < 2Ё%?, 
we obtain the following estimate for the nth partial sum s,: 


MET 1 1 

2°\3 "4 5 8 2k +1 2k n 
S кй ew UR c usq 4 au E 

2 4 8 2k 25.2 2 


k + 1 summands 
Consequently, we get 


2 k+l k+3 
Sn > = + = 


2 2 2 


Since for any (arbitrarily large) k є N, there exists a number и є N such that s, can become 
arbitrarily large, the partial sum s, is not bounded. Thus, the sequence of the partial sums 
{s,} does not converge. 


Definition 2.9 Let {an} be a geometric sequence. The series {sn} with 


is called a geometric series. 


For a geometric series, we can easily decide whether it converges or not by means of the 
following theorem. 


THEOREM 2.5 A geometric series {s,} with а 4 0 converges for |g| < 1, and we obtain 


п п 
: pi ; 1—4 
m s, = lim a1°q = lim а: = 
ша» n—oo 1 
ESI 


li ; 
n— oo SH 1—4 


For |q| > 1, series {s,} diverges. 


Example 2.16 Consider the series {s,,} with 
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This is a geometric series with aj = 2 and q = —1/2. Applying Theorem 2.5 we find that 
this series converges, since |4| = 1/2 < 1, and we obtain 


lim 5, = = = 


n— oo 1—4 1-(-3) 


A first general criterion for checking convergence of a series has already been discussed 
in connection with sequences (see Theorem 2.1). If the sequence of the partial sums is 
bounded and monotone, the series {s,} certainly converges. We consider a similar example 
to Example 2.15, which however uses Theorem 2.5 for estimating the terms $. 


Example 2.17 We consider the series {s,} with 


= 0 
==) тш, n € М. 


Determining the first partial sums, we obtain 


2 2 2 
spen ea a = 2+2=4, 53 = 50+ „ =4+1=5, 


0! 2! 
аа Д6 
Ыш Шш o 3! 
Since 
: 0 Жакем 
a= ———— > ora А 
k= Е DI 


sequence {sn} of the partial sums is strictly increasing. To check whether sequence {s,} is 
bounded, we first obtain for k > 3: 


_ oe 2 " 2 | 01 
SET EDO haan) let ba coe 
—— 
k—2 factors 


Denoting a; — 1 /2*-3_ k > 3, the following estimate holds for n > 3: 


Sn Saj +a +a +... H+% 
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i.e. each partial sum s, with n > 3 is no greater than six (notice that s; = 2 and 52 = 4). In 
the above estimate of sn, we have used the sum of a geometric series given in Theorem 2.5. 
Thus, sequence {s,} is strictly increasing and bounded. From Theorem 2.1, the series {sn} 
with 


converges. 


Definition 2.10 А series {s,} with 


is called alternating if the signs of the terms a; alternate. 


We can give the following convergence criterion for an alternating series. 


THEOREM 2.6 (Leibniz’s criterion) Let {s,,} with 
n 
Sn = у, ак 
К=1 
be an alternating series. If the sequence {|а|} of the absolute values is decreasing and 


lim a, = 0, 
k—oo 


series {sn} converges. 


Example 2.18 We consider the series (55) with 


n n k 
T: pam. n € М. 


This is an alternating series with 


3 
= == 3, = + = 3 - 7 = 5, 
$1 a1 $2 = S1 T a2 2 2 


Е 


54 = $3 + 24 = 


"n 


5 7 
2 4 
First, due to 

3 2. 3k-3(k+1) 3 
к+1 ЕК Ra OR 


lag41| — |ак| = 0, 
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sequence {|az|} is strictly decreasing. Furthermore, 


. (1! 3 
lim ————— —— = 0, 
коо 


and thus by Theorem 2.6, series (s,) converges. 


We now present two convergence criteria which can be applied to arbitrary series. 


THEOREM 2.7 (quotient criterion) Let {s,,} be a series with 


ak 


series {sn} converges. 
(2) IfQ > 1, series {s,} diverges. 
(3) For Q = 1, a decision is not possible by means of the quotient criterion. 


Example 2.19 We investigate whether the series {s,} with 
n 
k 
Sn = z qk’ neN, 
k=1 


converges. Applying the quotient criterion, we get with a, = k/4* 


ак+1| _ (Ё+1)-45 k+1 
а | 4.k — 4k’ 
Therefore, 
k+1 1 
О = lim КЕЕ s im —=-<l. 
Коо | ak k—oo 4k 4 


Thus, the given series converges. 


Example 2.20 Consider the series {s,,} with 


3k? 
Sn — 4k ^ 
k=1 2 


ne М. 
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Using the quotient criterion, we get 


akpi] З(Е+ 13-2 1 P +302 43k 41 
a, | 2.38 2 ГЕ 
апа 
1 
Q= lim | eee 
Коо | ак 2 


Thus, е series considered converges. 


THEOREM 2.8 (root criterion) Let (s,) be a series with 


n 
SQ = y ак. 
k=1 
Then: 
(1) If 
R= lim |а| < 1, 
k—oo 
series {sn} converges. 


(2) ТЁК > 1, series {s,} diverges. 
(3) For R = 1, a decision is not possible by means of the root criterion. 


Example 2.21 Let the series {sn} with 


be given. In order to investigate it for convergence or divergence, we apply the root criterion 
and obtain 


| k 1 
k/ k k 
[ax = ак = 3k = 2 . Wk. 


Further, 
R= lim ар = lim ае E 
Bcc pU rc NI E oc UE AZ. 
The latter result is obtained using the given limit (see limit (4) at the end of Chapter 2.1.2) 
lim Vk — 1. 
k— oo 


Hence, the series considered converges due to R < 1. 
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Example 2.22 Consider the series {s,,} with 


п VE ly 
«-XCT) л n e М. 


To check {s,} for convergence, we apply the root criterion and obtain 
2 
k+1\f 
R= lim Так = lim J| (=) 27k 
k—oo k— oo k 


li (рь. Nf _1 li Mb | 1 
= lim . Él. mJ om : 
pe k 2 пе k 3 AE 


The latter equality holds due to limit (8) given at the end of Chapter 2.1.2. Hence, since 
R > 1, series {sn} diverges. 


23 FINANCE 


In this section, we discuss some problems arising in finance which are an application of 
sequences and series in economics. Mainly we use geometric and arithmetic sequences and 
partial sums of such sequences. In the first two subsections, we investigate how one payment 
develops over time and then how several periodic payments develop over time. Then we 
consider some applications of these considerations such as loan repayments, evaluation of 
investment projects and depreciation of machinery. 


2.31 Simple interest and compound interest 


We start with some notions. Let P denote the principal, i.e. it is the total amount of money 
borrowed (e.g. by an individual from a bank in the form of a loan) or invested (e.g. by an 
individual at a bank in the form of a savings account). Interest can be interpreted as money 
paid for the use of money. The rate of interest is the amount charged for the use of the 
principal for a given length of time, usually on a yearly (or per annum, abbreviated p.a.) 
basis, given either as a percentage (p per cent) or as a decimal i, i.e. 


; Р a 
prt 


Hc 


If we use the notation i for the rate of interest, then we always consider a decimal value in 
the rest of this chapter, and we assume that i > 0. 

In the following, we consider two variants of interest payments. Simple interest is interest 
computed on the principal for the entire period it is borrowed (or invested). In the case of 
simple interest, it is assumed that this interest is not reinvested together with the original 
capital. If a principal P is invested at a simple interest rate of i per annum (where i is a 
decimal) for a period of п years, the interest charge І, is given by 


1, = Р.і:п. 
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The amount A owed at the end of n years is the sum of the principal borrowed and the interest 
charge: 


А=Р+Һ=Р+Р.ї-п=Р.(1+1ї-п). 


Simple interest is usually applied when interest is paid over a period of less than one year. 


If a lender deducts the interest from the amount of the loan at the time the loan is made, the 
loan is said to be discounted. In this case, the borrower receives the amount P (also denoted 
as proceeds) given by 


P=A4A-—A-i-n=4A-(l—i-n), 


where A - i - n is the discount and А is the amount to be repaid at the end of the period of 
time (also known as the maturity value). We emphasize that in the case of a discounted loan, 
interest has to be paid on the total amount A (not on the proceeds P) at the beginning of the 
period. 


Next, we assume that at the end of each year, the interest which is due at this time is added 
to the principal so that the interest computed for the next year is based on this new amount 
(of old principal plus interest). This is known as compound interest. Let Ag be the amount 
accrued on the principal at the end of year k. Then we obtain the following: 


A, =P+P-i=P-(1+i), 
Аз =A, +A isd ASPA, 
Аз = А + 45:i— Ao: (1+0 = Р: (04 Dy), 


and so on. The amount A, accrued on a principal P after n years at a rate of interest i per 
annum is given by 


An — P. P = Р: д. (2.1) 


The term q = 1 + i is also called the accumulation factor. Note that the amounts Ag, k = 
1,2,..., are the terms of a geometric sequence with the common ratio q = 1 + i. It is worth 
noting that in the case of setting 4o = Р, it is a geometric sequence starting with k = 0 
in contrast to our considerations in Chapter 2.1, where sequences began always with the 
first term а. (However, when beginning with Ао, one has to take into account that 4; is 
already the (n + 1)th term of the sequence.) The above formula (2.1) can be used to solve 
the following four basic types of problems: 


(1) Given the values P, i and n, we can immediately determine the final amount A, after n 
years using formula (2.1). 

(2) Given the final amount A,,, the interest rate i and the number of years n, we determine 
the principal P required now such that it turns after n years into the amount А, when the 
rate of interest is i. From formula (2.1) we obtain 


TM An 
+0" 
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(3) Given the values n, P and An, we determine the accumulation factor q and the rate of 
interest i, respectively, which turns principal P after n years into the final amount А,. 
From formula (2.1) we obtain 


4, " 
Saati". (2.2) 


Solving for i, we get 


nf An 
OM ues 
p.c 


[A 
Pa А 
P 


(4) Given the values A,, P and i, we determine the time n necessary to turn principal P at a 
given rate of interest i into a final amount А„. Solving formula (2.2) for n, we obtain 
An 
In — —n-In(14 i 
р =" (1+0 


Е In 4, — ҺР 
In(1 +i) 


We give a few examples to illustrate the above basic problems. 


Example 2.23 What principal is required now so that after 6 years at a rate of interest of 5 
per cent p.a. the final amount is 20,000 EUR? 
This is a problem of type (2) with n = 6, i = 0.05 and А = 20,000. We obtain: 


Ac 20,000 


P= = = 14,924.31 EUR 
(+05 (1+0.05)§ 


i.e. the principal required now is equal to 14,924.31 EUR. 


Example 2.24 How long will it take to double a principal of 20,000 EUR when the interest 
rate is 6 per cent p.a.? 
This is a problem of type (4). Using P = 20, 000, A, = 40, 000 and i = 0.06, we obtain 


In 40, 000 — In 20, 000 
n= x 11.9 
In 1.06 


i.e. it takes about 11.9 years to double the principal. 


Often, we do not have an annual period for interest payments, i.e. compounding takes place 
several times per year, e.g. 


(1) semi-annually — there are 2 interest payments per year, namely after every six months; 
(2) quarterly — there are 4 payments per year, namely one after every three months; 
(3) monthly — there are 12 payments, namely one per month; 
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(4) daily — compounding takes place 360 times per year (in this section, we always count 
12 months with 30 days each). 


Note that in all cases, it is assumed that interest payment is made after equal length periods. 
In the case of m interest payments per year, the rate of interest per payment period is equal 
to i/m, where i is as before the per annum rate of interest as a decimal, and the number of 
interest payments within a period of п years is equal to п. m. Denoting by А„ the amount 
at the end of n years with m interest payments per year, formula (2.1) changes into 


i пт 
Anm = Р. ( 2: ~) (2.3) 
m 


Example 2.25 Mrs Rich wishes to invest 15,000 EUR. The bank offers a rate of interest of 
6 per cent p.a. 

We compute the resulting amount of money at the end of four years when compounding takes 
place annually, quarterly and monthly. We use formulas (2.1) and (2.3) with P — 15,000 
EUR and i — 0.06. 

In the case of annual compounding, we get 


A4 = P (1+i)* = 15,000 - (1 + 0.06)* = 18,937.15 EUR. 


Hereafter, we use the equality sign although all amounts are rounded to two decimal places. 
When compounding takes place four times per year, we compute 444 and obtain 


44 
Ад = Р. (: + 5) = 15,000 - 1.015! = 19, 034.78 EUR. 


Finally, when compounding takes place monthly, we compute 4412 and obtain 


4-12 
A412 = Р. ( + 5) = 15,000 - 1.00548 = 19,057.34 EUR. 


If the compounding takes place several times per year or if the interest rate is not constant 
over time, one is often looking for the effective rate of interest іск. This is the rate of interest 
necessary to turn the principal into the final amount when compounding takes place annually. 
In the case of m interest payments per year at a per annum rate i of interest, there is an interest 
payment of i/m per interest payment period, and we obtain from formula (2.3) 


(Hip (: + z) 
m 


i m 
ig = (12) — 1. 
т 
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Similarly, if there are different per annum rates of interest ij,i2,...,i, with an annual 
compounding, we obtain from formula (2.1) 


(1+ eg)" = (+i) A +i)... Ain) 
ie = УОТА) +i). AHi) 1. 


Example 2.26 A bank offers a rate of interest of 6 per cent p.a. compounded quarterly. 
What is the corresponding effective rate of interest іо? 
Since m = 4, the rate of interest per period is 


1 — 0.015, 
4 
and we obtain 


іо = (1.015)* — 1 = 0.06136355 


i.e. the offer from the bank corresponds to an effective rate of interest of 6.14 per cent p.a. 


Finally we briefly discuss a mixture of simple and compound interest. This is used when 
interest is paid not only for complete interest periods. We illustrate this by the following 
example. 


Example 2.27 Assume that Mr Happy gives 10,000 EUR to a bank on 15 November 2001 
and that he would like to have money back on 20 February 2005. The bank offers a constant 
rate of interest of 4 per cent p.a. compounded annually. 

We can apply the formula for compound interest for the years 2002, 2003 and 2004, but 
for the parts of 2001 and 2005, we apply the formula for simple interest. Remembering our 
assumption that every month consists of 30 days, the bank pays interest for 45 days in 2001 
and for 50 days in 2005. 

Therefore, the amount at the end is given by 


A = 10,000 - [1+ 0.04 2 (1 + 0.04)? - | 1 + 0.04 2 
ES '" 360 | "^ 360 


= 11,367.69 EUR . 


In the above calculations, simple interest is computed for the 45 days from 16 November 
to 31 December 2001 (i.e. the principal is multiplied by 1 + 0.04 - 45/360) and for the 
50 days from 1 January to 20 February 2005 (i.e. the amount at the end of 2004 is multiplied 
by 1 + 0.04 - 50/360). For the three intermediate years from 2002 to 2004, we apply the 
compound interest formula, which leads to the factor (1 + 0.04)?. 
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2.3.2 Periodic payments 


In the previous subsection, we considered the computation of the future value of one 
investment when a fixed amount of money is deposited in an account that pays interest 
compounded periodically. In many situations, there are periodic payments (i.e. deposits or 
withdrawals) and the question is which amount is accrued (or left over) after a number of 
payment periods. Such situations occur, for example, in connection with annual life insurance 
premiums, monthly deposits at a bank, loan repayments, sinking funds and so on. 


The notion annuity is used in the following for a sequence of (usually equal) periodic pay- 
ments. Here we consider only the case when payment periods and the periods for interest 
payments coincide. Moreover, the interest is always credited at the end of a payment period. 


First, we consider a so-called ordinary annuity, where the payments (deposits) are made at 
the same time the interest is credited, namely at the end of the period. We mark all values by 
the superscript ‘E’ which stands for ‘end of the period’. 


Annual payments 


Again, we first consider the case of annual payments r. The amount of an annuity is the sum 
of all payments (deposits) made plus all interest accumulated. Next, we compute the amount 
of an annuity AF, i.e. the total amount after n annual payment periods. Denoting by D; the 
amount resulting from the kth deposit at the end of n years, we get 


AE = Di Dj RD —r 04i)" Hr (102+... е 


-e[easxoscaepr]. 


The term in brackets on the right-hand side represents the partial sum s, of a geometric 
sequence with a; = 1 and = 1 + i. Note that the kth deposit accumulates the value Dz, 
where the payments are compounded over n — k years. We use the formula for the partial 
sum of the first n terms of a geometric sequence and obtain 


1+@@+90+ + a+ = 


1- 0+9 1-ü-c' Q-i'-l 
=e ee EE Е і 


Thus, we have shown the correctness of the following theorem. 


THEOREM 2.9 (amount of an annuity) If an amount r is deposited at the end of each year 
at a per annum rate of interest i, the amount АЁ of the annuity after n annual payments is 


АЁ = ғ 


n 


(ED od 


(2.4) 


The present value of an annuity is the amount of an annuity discounted over the whole time 
period, i.e. it is the amount of money needed now so that after depositing this amount for a 
period of n years at a per annum rate of interest of i, the amount of an annuity АЁ results. We 


86 Sequences; series; finance 
get 
EG 1 B 1 (1+i)"-1 
P, ITA T N, E Ss > 
(1+ iy (1+ i)” i 
(14-2)? - [(1 4- D? — 1] 1-(1+07" 
= ү. - =f Е r 


1 1 


In connection with periodic withdrawals the present value PE has an important meaning. It 
is the amount of money needed now so that, if it is invested at a per annum rate of interest i, 
n equal money amounts r can be withdrawn at the end of each year without any money left 
over at the end. 


To illustrate this, let us argue that the sum of the present values of the individual withdrawals 
is indeed equal to РЁ. Assume that the withdrawals are made at the end of each year and let 
PV, be the present value of the kth withdrawal. Then 


PV, =r-——. 
re axo 
Summing up, we get 

PV = PV + РР +---+ РУ, 
з. NETTE АЕ 
r- жа ж r- 

(1+ i)? (1 +i)" 


=r 


"QUE 
= : 1+@+ КТ 
Bod ы М жс ae сы эы 
=r-(1+/) Ic, =r = 
Lip EE MUT ш. 


1 


i.e. the sum of the present values of all the n withdrawals corresponds to РЁ. Therefore, we 
have proved the following theorem. 


THEOREM 2.10 (present value of an annuity) Ifr represents the withdrawal at the end of 
each of n years, the present value РЁ of an annuity at a per annum rate of interest i is 


1— (14 i)" 
r: ———————. 


1 


РЁ = 


n (2.5) 
As a consequence, we do not need to distinguish between periodic deposits and withdrawals 
in the sense that formulas follow the same pattern. When considering periodic deposits, 
we are usually interested in the amount of an annuity АЁ, and when considering periodic 
withdrawals we are interested in the present value of an annuity РЁ. 


Next, we discuss the case when the payments are made at the beginning of the period while 
interest is still credited at the end. We mark the corresponding values by the superscript ‘B’ 
which stands for ‘beginning of the period’. Notice that this is not an ordinary annuity. 
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First, we discuss the modification of the formula for the amount of the annuity AP. 
The difference from the previous case is that for each deposit, interest is credited for one 
more period, i.e. the Ath deposit accumulates interest over n — k + 1 years. Let Dy be the 
amount at the end of n years which results from the kth deposit. Then Dg = r- (1 + jy kd 
Therefore, we get 


AB = р, 1+0, 2+... +D =r (0 +D)+r +D +... +r +i)” 
= @а+о+@а+д+а+0?+...+@+0"?+(@+0"—\] 


= a+) CEP LL 


By the above computations we have found that АВ = (1 + i) - AE which reflects the fact that 
for each deposit, interest is credited for one more year in comparison with deposits at the end 
of each year. 


Correspondingly, the present value PP is obtained by discounting the amount of an annuity 
for a period of n years, i.e. 

1 1-(1+)" 

РВ = AP Sr) — 

бекш he : 
We now discuss some basic types of problems connected with periodic payments (deposits 
or withdrawals). Here we consider only the case when payments are made at the end of each 
year and interest is credited at the same time. In the above formulas we have used the values 
PE, АЁ, r, i and n. Among these values, at least two (or even three) must be given for each 
problem type, and we are looking for one of the remaining values. 


(1) Given the per annum rate of interest i and the number of years n, we are looking for 
the amount of the annuity АЁ. This problem arises when annual deposits are made in 
an account, rate of interest as well as the time period are known in advance, and we 
wish to determine the amount in the account at the end of n years. In this case, we can 
immediately apply formula (2.4). 

(2) Given the values r, i and n, we are looking for the present value of the annuity PP. This 
type of problem arises when periodic withdrawals are made, and we wish to know the 
current amount required to guarantee the n withdrawals such that no money is left over 
at the end when the time period and rate of interest are known. In this case, we can 
immediately apply formula (2.5). 

(3) Given the values АЁ, i and n, we are looking for the annual payment r to get the final 
amount AF provided that the time period of the periodic payments and the rate of interest 
are known. From formula (2.4), we obtain 

i 


PELA ae 
"n dri 


(4) Given the values AE, r and i, we are looking for the time period n which is required to 
accumulate the final amount АЁ by periodic payments of an amount r at a given constant 
rate of interest i. In this case, we obtain 


A. =(1+0"—1 


FEE 
r 
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which can also be written as 
+i)" =14+ 42. *. 
r 


Taking the natural logarithm on both sides, we obtain 
In(1 +i)" = in(1 ny 3 | 
r 


Solving the latter equation for n and using the equality In(1 + i)” = n - In(1 + i), we get 


E In (1+ AE. 2) 


In(14 i) 2:9) 


5) Given the values AE, n and r, we wish to determine the accumulation factor q resp. the 
n q resp 
per annum rate of interest i = q — 1. This is, from a mathematical point of view, a more 


complicated problem. First, from formula (2.4), we obtain 


— 


AE. ir. [a 9 i" — 1]. 
This yields 
AE.(1 Ei - AES r- (14i —r 
r-(- i" 4.(1+0 + (42-1) =0 


AE AE 
a+- aeos (2-1) o 


Now, the problem corresponds to finding the zeroes of a polynomial of degree n in the 
variable q = 1 + i and in general, we cannot solve this equation for q. Therefore, we 
have to apply approximation methods for finding a root of this equation greater than one 
(remember that only values q = 1 + i > 1 are of interest). In Chapter 4, we discuss 
some procedures for finding a zero of a function numerically. However, there is another 
difficulty for big values of n: possibly there is no uniquely determined solution q > 1. 


Similarly to problems of type (3), (4) and (5), we can formulate the corresponding problems, 
where the amount of an annuity АЁ has been replaced by the present value of an annuity РЁ. 
As an example, for the problem of type (3), this gives the equality 

к= РЁ. | 


п Ted (2.7) 


Analogously, one can formulate the corresponding problems for the case when the payments 
are made at the beginning of the years. This is left as an exercise for the reader. We illustrate 
the solution of the basic problems mentioned above by some examples. 


Example 2.28 What is the principal required now in order to be able to withdraw 3,000 
EUR at the end of each of ten years (beginning at the end of the first year) so that no money 
is left over after these ten years provided that a rate of interest of 5 per cent p.a. is offered? 
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This question is the same as asking for the present value of an annuity PE with r = 3,000 
and i = 0.05. We obtain 


1— (1+7! 1 — (1 + 0.05)-10 
ы Бл Ие ENEO E d 6D UR: 


PŘ =r. 
1057 i 0.05 


i.e. the principal required now to ensure an annual payment of 3,000 EUR at the end of each 
year over a period of ten years is 23,165.20 EUR. 


Example 2.29 Mrs Thrifty intends to accumulate an amount of 25,000 EUR until the end 
of a period of 12 years. If she wants to make annual payments at the end of each year 
and the bank offers a rate of interest of 6.25 per cent p.a., what is the required annual 
payment r? 

This is a problem of type (3) with n — 12, 48, = 25,000 and і = 0.0625, and so we 
obtain 


0.0625 


= 25,000. 
(1 + 0.0625)12 — 1 


he AR ; = 1,460.43 EUR, 


i 
(1+ 012—1 


i.e. the annual amount that Mrs Thrifty has to put into an account at the end of each of 12 
years to reach her goal is 1,460.43 EUR. 


Example 2.30 Assume that Kevin has 2,000 EUR left over at the end of each year and that 
he puts this amount into a savings account. How long does Kevin need in order to accumulate 
20,000 EUR in his account provided that the rate of interest is 5.5 per cent p.a.? 

This is a problem of type (4), and so we apply formula (2.6) with r = 2,000 EUR, AE = 
20, 000 EUR and i = 0.055. This yields 


Р 0.055 
BETEVIPE In (1 +20, 000. $955) Mu 
Эа Ыр —— In(1 + 0.055) eee 


i.e. after approximately 8.2 years Kevin has accumulated 20,000 EUR in his account. 


Several payments per year 


Next, we briefly discuss the modifications if there are several payment and interest periods 
per year. Remember that we assume that payment periods are identical to interest periods. 
As before, denote by m the number of payment periods per year, and 4, ,, and P, ,, (either 
with superscript ‘E’ or ‘B’) denote the amount of the annuity resp. the present value of the 
annuity provided that the payments go over n years with m payment periods per year. In this 
case, the amount of the annuity is given by 


Es I 
Арт =: 
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Furthermore, the present value PL т of the annuity is given by 


lei 
je ola EE ; 


The corresponding values for the case when payments are made at the beginning of each 
period are obtained as 


Example 2.31 Claudia wants to put aside 100 EUR at the end of each month for a period 
of seven years. The bank offers a rate of interest of 3.5 per cent p.a. compounded monthly. 
What is the amount in Claudia's account after seven years? 

This is a problem of type (1), and Claudia needs to determine the amount of an annuity AF v 
where payments are made at the end of each period. Using r — 100 EUR and i — 0.035, 
she obtains 


84 
0.035 
wo + es s (+) - 
А7 =” = 100. 7035 = 9,502.83 EUR, 
12 12 


i.e. after seven years, there are 9,502.83 EUR in Claudia's account. 


2.3.3 Loan repayments, redemption tables 


One application of periodic payments is loan repayments. First, we again consider annual 
payments and later we briefly discuss the modifications in the case of several payments 
per year. We use the following notations for year k: 


Dy current amount of the loan (i.e. the debt) at the end of year k,k = 1,2,...,n (more 
precisely, after the payment at the end of year k), moreover let Do be the initial amount 
of the loan; 


Iy interest payment on the loan at the end of the Ath year, k = 1,2,...,n; 
Ry amortization instalment at the end of the kth year (i.e. the repayment on the loan), 


k=1,2,...,n; 
Ay annuity of the kth year, i.e. the amount that is paid at the end of year k, 4; = Ry + Ik, 
К =1,2,...,n. 


A loan with a fixed rate of interest is said to be amortized if both principal (i.e. the amount 
of the loan) and interest are paid by a sequence of payments made over equal time periods. 
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We consider the following two variants of loan repayments: 


(1) repayment by equal annuities (i.e. in each period a constant amount of amortization 
instalment and interest is paid); 

(2) repayment by equal amortization instalments (i.e. the repayments on the loan are identical 
in all years, but different payments are made over the years, since interest payments vary 
and therefore annuities are different). 


Equal annuities 


In this case, we assume that the annuity is the same in each year, i.e. 
Ак = Ik + Rk =A, where А is constant. 


First, we mention that the present value of all payments for paying off a loan corresponds 
to the present value РЁ of periodic payments at the end of each year. Therefore, finding the 
annuity A corresponds to the problem of finding the annual payment if a rate of interest i, 
the duration n of paying off the loan and the present value of an annuity are known, and 
payments are made at the end of each year. Thus, it corresponds to a problem of type (3) in 
the previous subsection, and we can apply formula (2.7) with PE = Do and r = A: 


i 


A= Do: ————— 
1— (14 i)?" 


(2.8) 
The current amount D% of the loan at the end of year k can be obtained as follows. First, 
if no payments would be made, the amount of the loan at the end of k years would be equal 
to D; = Do- (1+ i)* according to formula (2.1). Second, the annual payments at the end 
of each year result after k years in the amount AE of an annuity with payment A at a rate of 
interest i for k years according to formula (2.4). Then the remaining amount of the loan at the 
end of year k (i.e. after the payment in year k) is given by the difference of the two amounts 
Di and АЁ, ie. 


POPS 


Dy = Di — AE = ро. (14+. i — A4 ; 


(2.9) 


In formula (2.9), we can replace the initial amount of the loan Do using formula (2.5) with 
r — A and obtain 


1-(1+07" 
і 


СА) —1 
і 


Di S. пр 
А 


= = fa+o -a+ -a+ +1] 
1 

which gives 

P 1— pere, 


1 


D; = (2.10) 


For k = 0, formula (2.10) has the same structure as formula (2.5) for finding the present value 
of an annuity РЁ. However, the following comment has to be added. When computing the 
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annuity A according to formula (2.8), in practice this value is rounded to two decimal places. 
This leads to the situation that the annuity of the last year is usually a bit larger or smaller 
than the rounded value. Therefore, usually formula (2.9) is applied to the rounded value A, 
and formula (2.10) is only exact when the same annuity is paid in all years. (Alternatively, 
one can use a slightly different annuity in the last period so that D, = 0 is obtained.) 


The interest payment J, at the end of year k can be obtained as follows by taking 
into account that interest has to be paid on the current amount D;_; of the loan at the 
end of year k — 1: 


| weer | 
меран [раа ae - Е 


1 
о тея AS = BEDE) es 


Because J, = A — Rx, we immediately get the repayment R+ on the loan in year k from the 
last formula: 


В = DD A +D. 


From the formula before last we again see that J; = Ро · i. Asa conclusion, the amortization 
instalments {Rz} form a geometric sequence with the first term 


Ry edd 
and the quotient q = 1 + i, i.e. we have 
Ry, = В: 0D. 


Itis worth noting that in the case of paying in each year exactly the same annuity, the formulas 
for determining J; and А; can be simplified as follows: 


I = D-1 -i =4.(1— (1+ prin. 
Ry = А — Ik =A. (1 тул. 
It should be mentioned again that, since all values are rounded to two decimal places, there 


may be slight rounding differences depending on the formula used. 


In the case of m payments per year and m interest payments, we have to replace the per annum 
rate of interest i again by the rate of interest i/m per period and the exponent n (which is 
equal to the number of compoundings) by n · m. This yields an annuity 4’, to be paid in each 
of the m payment periods per year 


i 
Аһ = Do: ———— 
US ү" 


Similarly, the two replacements concerning the interest per period and the number of payment 
periods have to be made in the other formulas. 
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Equal amortization instalments 


In this case, we assume that the repayment on the loan is the same in each year, i.e. 


Ry =R, where R is constant. 


This means that in each year the repayment on the loan is 


n 


R 
Therefore, the remaining amount of the loan at the end of year k is given by 
D k 
Dy = Do —k- — = ро: (1- 3l 
n n 


The amounts {D;} form an arithmetic sequence with difference —Do/n. In period k, interest 
has to be paid on the current amount of the loan at the end of period k, i.e. 


iare ee 
ID = Dg-1 -i = Do- 1— 1. 
п 


Thus, the annuity А; of year Ё is given by 


Do : 
Aj = Rg +h = — + Юрі 
n 


k=i D 
= 04 Dy (1-4) = -neecken-a. 
n n 


In the case of m payment and interest periods per year, we have to replace i by i/m and n 
by n - m. A summary of the payments and the present values of the loan can be given in a 
redemption or amortization table. It contains for each period k: 


(1) the current amount of the loan at the beginning (D;. |) and at the end (D+) of the period; 
(2) the interest payment 1%; 

(3) the amortization instalment А; 

(4) the annuity (sum of interest payment plus amortization instalment), i.e. 4; = J, + Ry. 


We illustrate the two types of loan repayments by the following examples. 


Example 2.32 Tom intends to amortize a loan of 10,000 EUR at a rate of interest of 
7 per cent p.a. in six years. He decides to make annual equal payments at the end of each 
year. Using По = 10,000 EUR, i = 0.07 and n = 6, the annual annuity is obtained as 


i 0.07 


A= Dp —— —— = 10,000. —— — 
1—(1+ =" 1— (14 0.07)-6 


— 2,097.96 EUR. 
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For the first year (i.e. k = 1), we get 


I; = Do -i = 10,000 - 0.07 = 700.00 EUR, 
Rı = А— I; = 2,097.96 — 700.00 = 1,397.96 EUR, 
Dj = Do — Ri = 10,000 — 1,397.96 = 8, 602.04 EUR. 


For the second year (i.e. k = 2), we get 


h = D; - i = 8,602.04 - 0.07 = 602.14 EUR, 
R = A — = 2,097.96 — 602.14 = 1,495.82 EUR, 
р» = Ру — R2 = 8, 602.04 — 1,495.82 = 7, 106.22 EUR. 


Notice that the amount D» can also be directly determined by using formula (2.9) 
(without computing D; first). A complete redemption table is given as Table 2.1. It is worth 
noting that in the last year, the annual annuity is one cent lower than in the other years which 
is caused by rounding the value of A4 to two decimal places. Applying the given formulas, 
we obtain Rg — 1,960.71 EUR but Ds — 1,960.70 EUR. Therefore, one cent less is paid in 
year six. 


Table 2.1 Redemption table for Example 2.32 (equal annuities, annual payments) 


Period Current amount Interest Amortization Annuity Amount 

(vear) of the loan at instalment of the loan 
the beginning at the end 

k Рр n Rk A Dy 

1 10,000.00 700.00 1,397.96 2,097.96 8,602.04 

2 8,602.04 602.14 1,495.82 2,097.96 7,106.22 

3 7,106.22 497.44 1,600.52 2,097.96 5,505.70 

4 5,505.70 385.40 1,712.56 2,097.96 3,793.14 

5 3,793.14 265.52 1,832.44 2,097.96 1,960.70 

6 1,960.70 137.25 1,960.70 2,097.95 0.00 


Example 2.33 The Happy family have raised a mortgage on their house. They can repay 
the loan of 120,000 EUR at a rate of interest of 8 per cent p.a. over a period of 10 years. They 
decide to pay quarterly rates. This means that there are n -m = 10-4 = 40 payment periods. 
The quarterly annuity is obtained as 


1 
А4 = Do = —n-m 
1- (14 4) 
008 0.02 
4 . 
= 120,000. = 120,000. ——L— = 4,386.69 EUR. 
1 — 1.02-40 


—104 
1— (1 + 098) 


Table 2.2 is a redemption table that includes the corresponding values after each year (i.e. the 
corresponding amounts for periods 4, 8, . . . 40) to give an overview on the payments. As an 
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Table 2.2 Redemption table for Example 2.33 (equal annuities, quarterly payments) 


Period Amount of Interest Amortization Аппийу Amount 
(quarter) the loan at instalment of the loan 
the beginning at the end 

4k Без Tk. Rea А4 Dia 

4 113,919.93 2,278.40 — 2,108.29 4,386.69 111,811.64 

8 105,230.38 2,104.61 2,282.08 4,386.69 102,948.30 
12 95,824.54 1,916.49 2,470.20 4,386.69 93,354.34 
16 85,643.34 1,712.87 2,673.82 4,386.69 82,989,52 
20 74,622.89 1,492.46 2,894.23 4,386.69 71,728.66 
24 62,694.00 1,253.88 3,132.81 4,386.69 59,561.19 
28 49,781.78 995.64 3,391.05 4,386.69 46,390.73 
32 35,805.19 716.10 3,670.59 4,386.69 32,134.60 
36 20,676.47 413.53 3,973.16 4,386.69 16,703.31 


40 4,300.66 86.01 4,300.66 4,386.67 0.00 


illustration, we compute some of the values given in Table 2.2. It is worth emphasizing that 
we have to adapt the formulas given for the annual payments. At the end of the second year, 
8 payments have been made. Denoting by the index pair (k, m) the mth period in the kth year, 
we get for period 8: 


j 2-4-1 i i 2-4-1 
ра ={л,—р-5\)-(1+-— 
aeo (1+4) (4 0 i) (1+4) 


= (4,386.69 — 120,000 - 0.02) - 1.027 = 1,986.69 - 1.02’ = 2,282.08 EUR, 
D 4 = Ay — R24 = 4,386.69 — 2, 282.08 = 2, 104.61 EUR, 


„ү 2-4 i\2-4 
1+1) -1 
D24 = Do - (1 + 5) Ay: ( a) 


1 


4 


1.028 — 1 
= 120, 000 - 1.025 — 4, 386.69 - vc m 102,948.30 EUR. 


Similarly, at the end of the fifth year, 20 payments have been made. We get for period 20: 


5.4—1 
Rs4= Ri - (: + =) = 1,986.69 - 1.02! = 2,894.23 EUR, 


Is4 = Ay — R54 = 4,386.69 — 2, 894.23 = 1,492.46 EUR, 


‚у 5:4 i 
1+ 2 
Dsa =D: (19-2) a,‘ A 


.0220 — 1 
= 120,000 - 1.0220 — 4, 386.69 . ai 71, 728.66 EUR. 


Notice that for the computation of R54, amount А has been taken from the previous 
computation of amount А д. Again, the annuity of the last period is a bit smaller than the 
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annuities in the previous periods since the amount of the loan at the end of period 39 is 


„ү 39 i)39 
i 1+3) -1 
Роз = Do · (1+ ) A^: | a) 


4 


1.0222 — 1 
—  ——— = 4,300.66 EUR, 


and the interest payment in period 40 on the remaining loan is Лод = Роз · 1/4 = 
86.01 EUR. 


Example 2.34 The Young family intend to repay a loan of 9,000 EUR over a period of 
six years by equal amortization instalments at the end of each year. The rate of interest on 
the loan is 7.5 per cent p.a. The complete redemption table is given as Table 2.3. We again 
compute only some of the values given in Table 2.3. For the second year, we get 


| Do _ 9,000 


n 


= 1,500 EUR, 
jx. 1 
h = Do- ( E *) -i = 9,000 - (1 > 3) - 0.075 = 562.50 EUR, 
n 
2 2 
Dz = ро(1- 5 ) = 9,000. (1— = ) = 6,000 EUR. 
n 


Reminding ourselves that R4 = R, we get for the fourth year 


Table 2.3 Redemption table for Example 2.34 (equal amortization instalments) 


Period Amount of Interest Amortization Annuity Amount 

(ear) the loan at instalment of the loan 
the beginning at the end 

k Dy Tk Ry Ар Dy 

1 9,000.00 675.00 1,500.00 2,175.00 7,500.00 

2 7,500.00 562.50 1,500.00 2,062.50 6,000.00 

3 6,000.00 450.00 1,500.00 1,950.00 4,500.00 

4 4,500.00 337.50 1,500.00 1,837.50 3,000.00 

5 3,000.00 225.00 1,500.00 1,725.00 1,500.00 

6 1,500.00 112.50 1,500.00 1,612.50 0.00 
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2.3.4 Investment projects 


The considerations about periodic payments are also useful for evaluating a planned invest- 
ment project or for comparing several alternatives. In this subsection, we discuss the following 
two methods: 


(1) method of net present value; 
(2) method of internal rate of return. 


In both methods, the expected costs and returns are considered for a couple of years. For 
further considerations, we introduce the following notations: 


n considered time period (duration) of the project in years; 

Ck (expected) costs connected with the project in year k (k = 0, 1,..., и); 

Ry (expected) returns in year k (k = 0, 1,..., п), usually Ro = 0; 

By = Ry — Ck (expected) balance in year k (k = 0,1,...,5); 

i per annum rate of interest (at which the company can invest money 
otherwise). 


In the following, we always assume that the balances B4 for year k are taken into account at 
the end of the corresponding year. Moreover, we assume that the rate of interest i is constant 
over the years. However, using the comments with variable rates of interest (see part about 
compound interest), one can immediately adapt the given formulas to this case. 


Method of net present value 


The idea of this method is to compare the present values of the expected balances over the 
years. The net present value NPV is defined as 


n 
NPV — Ув, (e dj-* 
k=0 


HEBDO) АБ AHD... +B, AHD”. (2.11) 


In the above formula, Вк. (1 + i) * gives the present value of the balance Ву (i.e. arising at 
the end of year k and discounted to the time of the beginning of the project). Factor (1 + i) ^ 
is the reduction factor applied in year k for finding the present value of the kth balance. 
Finally, the net present value is the sum of the present values of all the expected balances 
within the considered time period of n years. 


If the net present value NPV is positive, the project is advantageous. In the case of a negative 
net present value, the project is not recommendable. If several investment possibilities are 
compared, the variant with the highest net present value has to be chosen, provided that this 
value is greater than zero. 


Example 2.35 А company can open a project which is going over a period of six years. 
The expected costs and returns are given in the second and third columns of Table 2.4. 
Moreover, the rate of interest at which the company can invest money otherwise is equal 
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Table 2.4 Net present value for the investment project in Example 2.35 


Year Costs Returns Balance Reduction factor Present value 
k Ch Ry By a+) of balance Bj 
0 160,000 0  —160,000 1.000000000 — 160,000.00 
1 10,000 45,000 35,000 0.914731192 32,110.09 
2 12,000 60,000 48,000 0.841679993 40,400.64 
3 15,000 67,000 52,000 0.772183480 40,153.54 
4 16,000 74,000 58,000 0.708425211 41,088.66 
5 25,000 55,000 30,000 0.649931386 17,888.02 

NPV: 11,490.02 


to 8 per cent p.a. We apply the method of net present value to evaluate whether the 
intended project is recommendable. In order to determine NPV according to formula (2.11), 
we compute first the balance for each year (column 4) and then the discounted present 
value of each balance (column 6). Moreover, the reduction factor (1 + i) ^ which has to 
be applied for the corresponding year is given (column 5). The sum of the present values 
(NPV = 11,490.02, see last row) is positive. Therefore, the company decides to implement 
the project. 


Method of internal rate of return 


This method assumes that the sum of the present values of the costs over the years is equal 
to the sum of the present values of the returns. In other words, it is assumed that the net 
present value is equal to zero. From this condition the rate of interest i (i.e. the internal rate 
of return) is determined and compared with the expected minimum rentability r. If i > r, 
then the project is advantageous, since it has a higher internal rate of return. In the case of 
i « r, the project is rejected. If several possible investment projects have to be compared, 
the project with the highest internal rate of return is chosen, provided that i > r. 


Example 2.36 A project which is going over two years has the (expected) costs Cj, and 
returns Rz at the end of year k given in Table 2.5. Moreover, the company expects a minimum 
rentability of 7.5 per cent p.a. 
We apply the method of internal rate of return. From NPV — 0, we obtain 

—35,000 + 17,500 - (1 + 07! + 22,000 - (1+ i)? = 0 
which yields after multiplying by —(1 + i)? the equation 

35,000 - (1+ 02 — 17, 500 - (1 + i) — 22,000 = 0 


or equivalently 


1 22 
ü0-i?--.ü4-)0-l:-0 
2 35 
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Table 2.5 Data for the investment 
project in Example 2.36 


Year Costs Returns Balance 


k C, Ry By 
0 35,000 0  —35,000 
1 6,000 23,500 17,500 


2 9,000 31,000 22,000 


This is a quadratic equation in the variable q = 1 + i, and we obtain the following two 
solutions for q: 


1 1 22 
= —+,/— + © x 0.25 + 0.83130706 ~ 1.08131 
N= atv 16+ 35 p ; 


q2 © 0.25 — 0.83130706 ~ —0.58131. 


From qı © 1.08131 we obtain approximately the rate of interest p = 100. (41 — 1) © 8.131 
per cent р.а. The second solution q2 is negative and therefore not of interest, since it leads 
to a negative rate of interest. Because i ~ 0.0813 > r = 0.075, the project appears to be 
worthwhile. 


Finally, we consider an example comparing both methods. 


Example 2.37 An enterprise has the possibility to realize two projects A and В. 
Both projects are going over two years and the expected costs and returns are given in 
columns 2 and 3 of Table 2.6. Alternatively the firm can invest money at a rate of interest of 
8 per cent p.a. 

We apply both methods to decide whether one of the projects can be chosen and if so which 
of the projects is better. Based on the balance B, for the corresponding year k (column 4), 
in column 5 the reduction factors for computing the present values of the balances for each 
of both projects are given (column 6). Summing up the present values of the balances, we 
get a net present value of 1,242.80 EUR for project A and a net present value of —754.46 
EUR for project B. According to the method of net present value, project A is chosen, since 
only project A has a positive net present value. When applying the method of internal rate 
of return, we get for project A from NPV = 0: 


—36,000 - (1 + i? + 18,000 - (1 + i) + 24,000 = 0 
which can be written as 
(1+ 02 | (44 i) E 
D – =. їй) – — = 0. 
2 3 


This quadratic solution has two solutions in the variable q = 1 + i, namely 


Ex. + 3+ 32 0853912563 ~ 1.103912563 
а 48 47 noe 
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Table 2.6 Data for projects A and B given in Example 2.37 


Year Costs Returns Balance Reduction factor Present value 


k Cr Ry By (++) of balance Bj, 
Project A 
0 36,000 0 —36,000 1.000000000 —36,000.00 
1 4,000 22,000 18,000 0.925925925 16,666.67 
2 7,000 31,000 24,000 0.857338820 20,576.13 
МРУ 1,242.80 
Project В 
0 75,000 0 —75,000 1.000000000 —75,000.00 
1 12,000 32,000 20,000 0.925925925 18,518.52 
2 16,000 81,000 65,000 0.857338820 55,727.02 
МРУ —754.46 


апа 
1 
д2 f = 0.853912563 < 0. 


Since the second solution leads to a negative rate of interest, we obtain from the first solution 
the internal rate of return i ~ 0.1039, or equivalently p ~ 10.39 per cent. Correspondingly, 
for project B we get from NPV = 0 


—75, 000 - (1 + i)? + 20,000 - (1 + i) + 65,000 = 0 
which yields the quadratic equation 


4 13 
1+0?——.-(1+0—— =0. 
(1 + i) 15 (1+0 15 


We obtain the two solutions in the variable q = 1 + i 


= jaa м 2. + 0,940449065 ~ 1.073782398 
11 = i5 25 I5 ` ie: 


and 
2 
д2 f 15^ 0.940449065 ~ —0.807115731 < 0. 


The first solution yields the internal rate of return of i ~ 0.0738, or correspondingly 
7.38 per cent. 

Using the method of internal rate of return, we use r = 0.08 as minimum rentability. We have 
found that project A leads to an internal rate of return of 10.39 per cent while project B yields 
an internal rate of return of only 7.38 per cent. By the method of internal rate of return, 
project A is also preferred, since only project A has an internal rate higher than the minimum 
rentability of 8 per cent. So both methods yield the same recommendation concerning the 
possible realization of one of the projects. 
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2.3.5 Depreciation 


The book value of a machine or an industrial facility decreases over the years. Next, we 
discuss two variants of depreciation on machinery, namely linear depreciation and degressive 
depreciation. 


Linear depreciation 


In the case of linear depreciation, the depreciation rate D per year is constant. Denoting by 
Bo the initial book value (i.e. the cost of the machine or facility) and by B, the final value 
after the useful life of the machinery, then the depreciation rate D per year is obtained as 


_ Bo — Bn 


^ 


D 


n 


and we obtain the book value B; after k years as follows: 
By = Вр] — D = Ву – (k — 1) - D. 


The book values {B+} form an arithmetic sequence with difference —D. 


Example 2.38 A facility has been bought for 50,000 EUR. The useful life is six years, and 
the final book value is 8,000 EUR. Therefore, the annual depreciation rate is 
Bi Bn — Bo _ 50,000 — 8,000 
E n > 6 


= 7,000 EUR. 


We obtain the depreciation table given as Table 2.7. 


Table 2.7 Depreciation table for Example 2.38 (linear 
depreciation) 


Year | Bookvalueat Depreciation Book value 


the beginning amount at the end 

k Br} Di By 

1 50,000 7,000 43,000 
2 43,000 7,000 36,000 
3 36,000 7,000 29,000 
4 29,000 7,000 22,000 
5 22,000 7,000 15,000 
6 15,000 7,000 8,000 


Degressive depreciation 


We discuss two types of degressive depreciation, namely arithmetic-degressive depreciation 
and geometric-degressive depreciation. We begin with the arithmetic-degressive depre- 
ciation. This type of depreciation is characterized by a linearly decreasing depreciation 
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amount, i.e. 
Dy = Di, — d. 
Here we consider only the special case of D, = d, i.e. the last depreciation amount is equal 


to the annual reduction in the depreciation amount (digital depreciation). The depreciation 
amounts {2} form an arithmetic sequence with the first term 


D, =n-d 
and the difference —d. The kth depreciation amount in dependence on D, and d is given by 
Dy = Di — (k — 1) :а. 
The book value Ву at the end of year k is given by 
k k 
В; uim = Bo- =: Di + Dy) 


k k 
= Во 5 [n:d+(n-k+1):d]=Bo- 3: @n-k+1):d. 


Next, we deal with geometric-degressive depreciation. This type of depreciation is character- 
ized by a constant depreciation percentage p with i = p/100 in comparison with the current 
book value, i.e. the depreciation amount in year k is given by 


Dy —By 1-і, k=1,2,...,n. 
In this case, we get the book value 
By, = Во: (1— i* (2.12) 


at the end of year k. Thus, the book values {Вк} form a geometric sequence, and the 
depreciation rate in terms of Bo and i for the kth year is given by 


Dy = Be-1-i = Во: (0 - D 5. i. 


Typically, the initial book value Bo (i.e. the cost of the machine or a facility) and the book 
value at the end of the useful life B, are given. Then the depreciation percentage p resp. i are 
obtained from formula (2.12) for k — n as follows: 


B, 
p=100-i with i=1-—J—. 
Bo 


Notice that in the case of geometric-degressive depreciation, a final book value of B, = 0 is 
not possible. 
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Example 2.39 


A company buys a machine for 40,000 EUR and intends to use it for 8 years. 


The final book value Bg is 6,000 EUR. Value i is obtained as 


i=1 


| Bg | 6,000 
Pope 7: 0.211119161. 
Bo 40, 000 


We present a depreciation table for the case of geometric-degressive depreciation as Table 2.8. 
For instance, the values for the third year are obtained as follows: 


By = Во: (1 — i)? ~ 40,000 - 0.7888808382 = 24, 893.32 EUR, 


рз = Во: (1— i-i = 5,255.46 EUR, 


Вз = Ву. (1 — i)? = 19, 637.86 EUR. 


Table 2.8 Depreciation table for Example 2.39 
(geometric-degressive depreciation) 


Year Bookvalueat Depreciation Book value 
the beginning amount at the end 
k Br} Dk Bk 
1 40,000.00 8,444.77 31,555.23 
2 31,555.23 6,661.91 24,893.32 
3 24,893.32 5,255.46 19,637.86 
4 19,637.86 4,145.93 15,491.93 
5 15,491.93 3,270.64 12,221.29 
6 12,221.29 2,580.15 9,641.14 
7 9,641.14 2,035.43 7,605.71 
8 7,605.71 1,605.71 6,000.00 


EXERCISES 


2.1 (a) An arithmetic sequence {an} has a first term a; = 15 and difference d = 8. Find 


the term a101. 


(b) For an arithmetic sequence, the terms ag = 21 and ajọ = 25 are known. Find 


difference d and the terms a, and ay. 


2.2 The sequence {a,} has 


(a) 
(b) 
(c) 
2.3 (a) 


a = 3 + =, n € №. 


Is sequence {an} decreasing or increasing? 
Is (a4) a bounded sequence? 


Find a recursive formula for this sequence. 


Let {a,} be a geometric sequence with the ratio of successive terms q = —2/3 
and the term ау = 64/243. Find the first term ај. Which of the terms is the first 


with an absolute value less than 0.01? 
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2.4 


2.5 


2.6 


2.7 


2.8 


2.9 
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(b) A geometric sequence has the terms a? = 6 and a7 = 2/81. Find the first term 
a, and the ratio of successive terms q. 


Given are the sequences 


(and - 2-35. (on) = |" and tn = [5]. neN. 
n n n. 


Are these sequences monotone and bounded? Find the limits of the sequences if they 
exist. 


Find the limits of the following sequences if they exist (n € N). 


R; 


n an? 3 n? 
(a) {an} ls ms | , ( ) { n) I; $ =| д 
(c) {cy} with cy = c, 1/2. Check it for c; = 1 and c, = 4. 
A textile company starts in its first year with an output of 2,000 shirts and 1,000 


trousers. The production of shirts increases by 500 shirts every year, and the number 
of trousers rises by 20 per cent every year. 


(a) How many shirts and trousers are produced in the second, third and tenth years? 
(b) What is the total output of shirts and trousers after 15 years? (Use given formulas.) 


Find the limit s = lim s, for the following series {sn}: 
n— oo 


n n 


(—2)Ё+! 1 
(а) s=). 5k > (b) BD так 


k=1 k=1 


(Hint: find a formula for sn.) 
Check with respect to the real numbers x whether the following series {s,,} converge: 


n k n k2 


@ s=) vn 6 =). 


k=1 k=1 


Consider the following series {s,,} with: 
n 4k +2 n k2 
@ = Dv 1622. ® s= hy 


n k2 n 
k k, l 
©) s=} (g) : @ s= [C+ Zz]. 
К К=1 (z + 1 Í k=1 k 
Find the first four terms of the series and calculate the partial sums 51, 52,53 and sq. 
Check whether the series converge. 


Which is the best investment for a family wishing to invest 10,000 EUR for 10 years: 


(a) an interest rate of 6 per cent annually for 10 years; 

(b) an interest rate of 7 per cent for the first 5 years and 5 per cent for the next 5 years; 

(c) an interest rate of 7.5 per cent for the first 2 years, then 6 per cent for the next 
4 years and 5.5 per cent for the last 4 years? 


2.14 


2.15 


2.16 


2.17 


2.18 


2.19 
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A grandfather wants to give his grandson 100,000 EUR at his thirtieth birthday. How 
much does he need to invest on the grandson’s eighteenth birthday if the bank offers 
a rate of interest of 6 per cent p.a.? 


What interest rate will turn 2,000 EUR into 3,582 EUR after ten years? 
Tom invests 10,000 EUR. The bank offers a rate of interest of 5 per cent p.a. 


(a) Whatis the amount after 10 years when interest is compounded annually, quarterly 
and monthly, respectively? 
(b) Find the effective rate of interest Те for quarterly and monthly compounding. 


Ann has given 5,000 EUR to a bank on 20 October 2003, and she wants to get the 
money back on 20 April 2006. What is the amount at the end if the bank has credited 
interest of 3 per cent p.a. compounded annually? 


You decide to put away 200 EUR every month. Bank A offers simple interest of 6 per 
cent per annum, and bank B offers 0.48 per cent per month compounded monthly. 
After making 12 deposits, how much money would you have at the end of one year 
in banks A and B, respectively? 


(a) The deposits are made at the same time the interest is credited (i.e. at the end of 
each month). 

(b) The deposits are made at the beginning of each month and the interest is credited 
at the end. 


A man wants to buy a car for 24,000 EUR. He decides to pay a first instalment of 
25 per cent and the remainder with a bank loan under the following conditions. The 
borrowed sum will be paid back in one payment at the end of three years. During 
these 36 months interest is to be paid monthly at a per annum rate of 4.8 per cent. At 
the same time a sinking fund has to be set up in order to repay the loan. The man has 
to make monthly deposits into his sinking fund, which earns 3 per cent interest per 
annum compounded monthly. 


(a) Find the sinking fund deposit. 
(b) How much money does the man need each month to pay the sum of deposit and 
interest? 


What is the annual deposit required to pay out 10,000 EUR after five years? Interest 
of 5 per cent per annum is compounded annually. 


(a) Find the present value ofan annuity at arate of 5 per cent p.a. and with a withdrawal 
of 6,000 EUR at the end of each of 15 years. 

(b) How does the present value change if the withdrawal at the end of each month is 
500 EUR, assuming an ordinary annuity at 5 per cent p.a.? 


On 1 January 2015, Peter will retire. He has paid an instalment of 20,000 EUR on 
1 January 2000 and after that, he makes annual payments of 2,000 EUR every year 
up to 2014 on 1 January. The bank offers 5 per cent p.a. over the whole period. 


(a) What is the amount on 1 January 2015? 

(b) Beginning with 31 January 2015, Peter wants to make a monthly withdrawal of 
500 EUR. What is the present value of the annuity after ten years? Interest is 
compounded monthly at a rate of 5 per cent p.a. 
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2.20 


A family has raised a mortgage of 150,000 EUR on its house. It can amortize the 
balance at 8.5 per cent p.a. for ten years. 


(a) What are the annual payments? What is the total payment? Give a complete 
redemption table with the current amount of the loan at the beginning and at the 
end of each year, interest payment and amortization instalment for all the 10 years. 

(b) Find the total payment if the family repays 15,000 EUR plus the interest on the 
current amount of the loan every year (equal amortization instalments). 

(c) How many years does the family repay when the annual annuity is A = 18,000 
EUR? 


A project which is going over two years has the expected costs and returns at the end 


of year k (in EUR) given in the following table: 


Year Costs Returns 
k Ck Ry 


0 40,000 0 
1 5,000 18,000 
2 6,000 35,000 


(a) Let the rate of interest be i = 5 per cent p.a. Check by means of the method of 
net present value and by the method of internal rate of return whether the project 
should go ahead. 

(b) What are the percentage rates of interest (use only integers) for the project being 
advantageous? 

(c) In the third year, the costs for the project will be C4 = 6,000 EUR and the 
returns R3 = 36,000 EUR. Is it now profitable to implement the project with 
i = 5 per cent p.a.? 


A transport company has bought a bus for 100,000 EUR. After eight years, the bus is 
sold to another company for 44,000 EUR. Compare depreciation amounts and book 
values at the end of each of the eight years when the company uses 


(a) linear depreciation, 
(b) arithmetic-degressive depreciation, 
(c) geometric-degressive depreciation. 


3 Relations; mappings; functions of a 
real variable 


In this chapter, we deal with relations and a specification, namely mappings. Furthermore we 
introduce special mappings that are functions of one real variable, and we survey the most 
important types of functions of a real variable. 


3.1 RELATIONS 


A relation indicates a relationship between certain objects. A binary relation R from a set A 
to a set B assigns to each pair (a, b) € A x B one of the two propositions: 


(1) ais related to b by R, in symbols: aRb; 
(2) ais not related to b by R, in symbols: a Rb. 


Definition 3.1 A (binary) relation R from a set A to a set B is a subset of the Cartesian 
product A x B, i.e. R C A x B. 


We describe a relation R in one of the two following forms: (a,b) є R <> aRb or R = 
{(a,b) € Ax B | aRb}. If R is a relation from set A to set A, we also say that R is a relation 
on A. 


Example3.1 Let us consider a relation А С 2 х 2 on 2 as follows. An integer i є Z 
is related to integer j € Z by R if and only if they have the same remainder when divided 
by 5. This means that e.g. 7 is related to 17 by R (i.e. 7R17) since both numbers give the 
remainder 2 when dividing by 5. Similarly, 9 is related to 4 by R (for both numbers, we get the 
remainder 4 when dividing them by 5), while 6 is not related to 13 (since 6 gives remainder 1, 
but 13 gives remainder 3) and 0 is not related to 23 (since 0 gives remainder 0, but 23 gives 
remainder 3). Moreover, relation R has the following two properties. First, every integer a 
is related to itself (1.e. aRa, such a relation is called reflexive), and relation R is symmetric, 
i.e. it always follows from aRb that bRa. 
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Next, we give the notions of an inverse and a composite relation. 


Definition 3.2 Let R С A x B bea binary relation. Then the relation 
“l= {(b,a)€ B x A | (a,b) €E R} CB xA 


(read: R inverse) is the inverse relation of R. 


The following definition treats the case when it is possible to ‘concatenate’ two relations. 


Definition 3.3 Let 4, B, C be sets and R C A x B, S C B x C be binary relations. 
Then 


SoR={(a,c) € A x C | ћеге exists some b є B such that (a, b) € RA (b, с) є S} 


is called the composite relation or composition of R and S. 


The composition of two relations is illustrated in Figure 3.1. The element a € A is related to 
the element c € C by the composite relation S o R, if there exists (at least) one b € B such that 
simultaneously a is related to b by R and b is related to c by S. We emphasize that the sequence 
of the relations in the composition is important, since the relations are described by ordered 
pairs. Therefore, in general 5 o R zz Ro S may hold, and one can interpret Definition 3.3 in 
such a way that first relation R and then relation S are used. 


Figure 3.1 The composition S o R. 
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The following theorem gives a relationship between inverse and composite relations. 


THEOREM 3.1 Let A,B,C be sets and R C A x B and S С B x C be binary relations. Then 
(So R)! =R los, 


Theorem 3.1 is illustrated by the following example. 


Example 3.2 Consider the relations 
К= (х,у) ER |y=e} ad S={(,y) є |x-y <2}. 
We apply Definition 3.3 and obtain for the possible compositions of both relations 
SoR={(x,y) € R? | 3z eR wihz = е Az—y <2} 
= (у) Є |e —y < 2) 


= ((,у) є |у> е – 2) 
and 
RoS = {(x,y) є R? | az є R withx —z<2Ay=2} 
= {(x,y) em y>O0Az=InyAx-—Iny x 2} 
= {(x,y) є В? |y>O0AlIny 2 x—2) 


= (Gy) є В |y > e?) 


The graph of both composite relations is given in Figure 3.2. Moreover, we get 
(8 о №)! = {(у,х) є R |y > e – 2), 
(RoS) = ([0,х) € R? | y > e? 


We now illustrate the validity of Theorem 3.1. Using 
Е = [0,х) € R? |y=e%}, 
S| = {0,х) ER? |х-ух 2}, 


we obtain 
R`! oS! = {(y,x) € R? | az € R withz -y < 2A z = е} 
= (ox ER |e -y <2} 
= {(у,х) ER | у> еї — 2} 


and 
ST! oR! = {(у,х) € R? | 3z € R withy = её Ax =z < 2} 
= {(y,x) ER? | y>0Ax—Iny x 2) 
= {(y,x) ER? |y>0AlIny 2 x—2) 
= (09,39 eR? | y > е2). 
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(a) SoR (b) RoS 


Figure 3.2 The compositions S o R and R o $ in Example 3.2. 


3.2 MAPPINGS 


Next, we introduce one of the central concepts in mathematics. 


Definition 3.4 Let A, B be sets. A relation / С A x B, which assigns to every a € A 
exactly one b € B, is called a mapping (or a function) from set A into set В. We write 


f:A— B 
or, if the mapping is considered elementwise: 
аєАњ (а) —be B. 


The set A is called the domain of the mapping, and the set B is called the target. 
For each a € A, f (a) = b is called the image of a. The set of images of all elements of 
the domain is called the range f (A) С B of the mapping. 


A mapping associates with each element of A a unique element of B; the range f(A) may be 
a subset of B (see Figure 3.3). It follows from the above definition that a mapping is always 
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Domain A Target B 
i E Range 
s fA 
S tA 
e— | . 
7 w 


Figure 3.3 Mapping f : A > B. 


a relation, namely a particular relation R such that, for each a € A, there is exactly one pair 
(a, b) € A x B with (a,b) є R. On the other hand, not every relation constitutes a mapping, 
e.g. the relation 


В = ((х,у) є, xR|x =y} 


is not а mapping. Indeed, for each x є IR, \{0}, there exist two values y; = —./x and 
y2 = A/x such that (х,у) € R and (х,у) є R (see Figure 3.4). 


у, 


Y 


Figure 3.4 Relation R = (x, y) € Ry x R | x = 52). 
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Example 3.3 The German Automobile Association ADAC annually registers the 
breakdowns of most types of cars in Germany. For the year 2002, the following values 
(number of breakdowns per 10,000 cars of this type) for 10 types of medium-size cars with 
an age between four and six years have been obtained (see ADAC journal, May 2003): 
Toyota Avensis/Carina (TOY): 7.6, Mercedes SLK (MSLK): 9.1, BMW Z3 (BMWZ): 9.9, 
Mazda 626 (MAZ): 10.2, Mitsubishi Carisma (MIT): 12.0, Nissan Primera (NIS): 13.8, Audi 
A4/S4 (AUDI): 14.0, VW Passat (VW): 16.1, Mercedes C-class (MC): 17.5, BMW Series 
3 (BMW3): 17.8. We should emphasize that we have presented the results only for the ten 
most reliable cars in this report among the medium-size cars. 

Assigning to each car type the rank (i.e. the most reliable car gets rank one, the car 
in second place gets rank two and so on), we get a mapping f : Dy — N with 
Dy = (TOY, MSLK, BMWZ, MAZ, MIT, NIS, AUDI, VW, MC, BMW3}. Mapping f is given 
byf : (TOY, D, (МЅІК, 2), (BMWZ,3), (MAZ, 4), (МІТ, 5), (NIS, 6), (AUDI, 7), (VW, 8), 
(MC, 9), (BMW3, 10)}. 


We continue with some properties of mappings. 


Definition 3.5 А mapping f from A into В is called surjective if f(A) = B, i.e. for 
each b c B, there exists (at least) an a € A such that f(a) = b. 


A surjective mapping is also called an onto-mapping. This means that every element in set 
B is the image of some element(s) in set A. 


Definition 3.6 А mapping f from A into В is called injective if for all a1, a2 € A the 
following implication holds: 


aj Ф a2 = f (a1) £ f (a2) 


(or equivalently, f (a1) = f (a2) > a1 = аз). 


This means that no two elements of set A are assigned to the same element of set B. 


Definition 3.7 A mapping f from A into B is called bijective if f is surjective and 
injective. 


A bijective mapping is also called a one-to-one mapping. The notions of a surjective, an 
injective and a bijective mapping are illustrated in Figure 3.5. 
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(a) Relation f is (b) An injective but not 
not a mapping surjective mapping g 


(c) A surjective but not (d) A bijective mapping k 
injective mapping h 


Figure 3.5 Relations and different types of mappings. 


Example 3.4 Consider the following mappings: 
f:R В with f(x) = х2 
NN with g(x) = х? 


R — В, with h(x) = x? 


9: 
h: 
k:R, > R} with k(x) =x? 

Although we have the same rule for assigning the image f(x) to some element x for all 


four mappings, we get different properties of the mappings f, g, h and k. Mapping f is not 
injective since e.g. for x; = —1 and x? = 1, we get f (x1) = f (x2) = 1. Mapping f is also 
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not surjective since not every y є R is the image of some x € Dy = R. For instance, for 
у = —1, there does not exist an x є R such that x? = y. 

Considering mapping g, we see that g is injective. For any two different integers x; and x2, 
obviously the squares x? and xi are different, i.e. no two different natural numbers have the 
same image. However, mapping g is still not surjective since only squares of natural numbers 
occur as image f(x). For instance, for y = 2, there does not exist a natural number x such 
that x? — y. 

Mapping л is surjective since for each y є R+, there exists (at least) one x є R such that 
f (x) = y, namely for x, = „/у (and for x? = — Jy, respectively), we get 


Јод) = С)? =y (хә) = (У)? = у). 


Mapping Ё is not injective for the same reason that mapping f was not injective. Considering 
now mapping k, it is injective since any two different non-negative real numbers x, and x2 
have different squares: x? x ee Mapping Ё is also surjective since for any y є Кү, there 
exists an х = ,/y > 0, such that 


fO=f6 9) Qn = у. 


Therefore, mapping k is bijective. 


Next, we define an inverse mapping and a composite mapping. 


Definition 3.8 Let f : А — B bea bijective mapping. Then the mapping 
ева 


(read: f inverse) which assigns to апу b є В that a є А with b = f(a), is called the 
inverse mapping of mapping f. 


Definition 3.9 Letf : 4 — B and g : C — D be mappings with f(A) С C. Then 
the mapping 


gof:A— D, 


which assigns to any a € A a unique image (g o f)(a) = g(f(a)) € D, is called the 
composite mapping or composition of f and g. 


The above definition reinforces the concept of a composite relation for the case of mappings. 
This means that for some a € Dy, mapping f is applied first, to give the image f(a). This 
image is assumed to be an element of the domain of mapping g, and we apply mapping g to 
f(a), yielding the image g (f (a)). It is worth emphasizing that, analogously to compositions 
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of relations, the order of the sequence of the mappings in the notation g o f is important, i.e. 
‘g is applied after f". 
Example3.5 Consider the mappings f : 4 — A given by 
(1,3), (2, 5), (3, 3), (4, 1), (5, 2)} 
and g : A — A given by 
(1,4), 2, 1), G, 1), (4, 2), (5, 3)]. 
For the ranges of the mappings we get 
f (A) = {1,2,3,5} and g(A) = {1, 2, 3, 4}. 
We consider the composite mappings f о g and g o f and obtain 


(о 4)(1) = 70901) = 704) = 1, (fog)2 —f( 23, (fog)3) =f) =3, 


(fog)4) = 70) = 5, (fog) 2/3) = 3, 
(gof)1) = 9001) = 9(3) = 1, (gof)2) = 9(5) = 3, (gof)3 = #8) = 1, 
(9 of)(4) =g) = 4, (g of)(5) = gQ) = 1. 


The composition g o f is illustrated in Figure 3.6. 


5 


Figure 3.6 The composite mapping g o f in Example 3.5. 


For composite mappings g o f, we have the following properties. 
THEOREM 3.2 Letf : A — Band g : B — C be mappings. Then: 
(1) If mappings f and g are surjective, mapping g o f is surjective. 


(2) If mappings f and g are injective, mapping g o f is injective. 
(3) If mappings f and g are bijective, mapping g o f is bijective. 
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Example 3.6 Consider the following mappings: 


/:К— R with f(x) = 3х + 5; 
а:К— К with ga) =e +1. 


First, we show that both mappings f and g are injective. Indeed, for mapping f, it follows 
from х Æ x2 that 3x, + 5 Æ 3x2 + 5. Analogously, for mapping g, it follows from x; 4 x2 
that e*! + 1 A e? + 1. Then mapping g o f with 


(g of )(x) E g(f (x)) = g(3x + 5) = ge +1 


is injective as well due to part (2) of Theorem 3.2. 


We can formulate a similar property for mappings as given in Theorem 3.1 for relations. 


THEOREM 3.3 Let f : A — B and g : B — C be bijective mappings. Then the inverse 
mapping (f o 9)! exists, and we get 


(pog) =g of. 


Finally, we introduce an identical mapping as follows. 


Definition 3.10 A mapping J : A — A with I (a) = a is called an identical mapping. 


If for a mapping f : A — B the inverse mapping f7! : B — A exists, then 
fof =f of =1, 
where / is the identical mapping. This means that for each a € A, we get 


ff Ma) =f FA) = a. 


3.3 FUNCTIONS OF A REAL VARIABLE 


In this section, we consider mappings or functions f : A — B with special sets A and B, 
namely both sets are either equal to the whole set IR of real numbers or equal to a subset 
of R. 
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3.3.1 Basic notions 


Definition 3.11 A mapping f that assigns to any real number x є Dy € R a unique 
real number y is called a function of a real variable. The set Dy is called the domain 
of function f , and the set 


Ку =f (De) = ty ly =f), x € Ру} 


is called the range of function f. 


We write y = Р(х), х € Dy. Variable x is called the independent variable or argument, and y 
is called the dependent variable. The real number y is the image or the function value of x, 
i.e. the value of function f at point x. The domain and range of a function are illustrated in 
Figure 3.7. 


y 
n 
dopo ie уе UD. 
| 
| | 
Ву [c,d] i 
| | 
| 1 
"e | 
— - 
a D;= [a,b] b x 


Figure 3.7 Domain and range of a function f : Dr — Ку. 
We also write 
f: Drs >R or f : Df — Ry. 
Alternatively we can also use the notation 
fiio) у= о), x € De}. 
Hereafter, the symbol f denotes the function, and f (x) denotes the function value of a certain 


real number x belonging to the domain Dy of function f. To indicate that function f depends 
on one variable x, one can also write f = f(x) without causing confusion. 


A function f : Dy — Ry can be represented analytically, graphically or in special cases by 
a table presenting for each point x € Dy the corresponding function value y — f(x) € Ry. 
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We can use the following analytical representations: 
(1) explicit representation: y = f (x); 


(2) implicit representation: F (x,y) = 0; 
(3) parametric representation: x = g(t), y=h(t); єр, = Dh. 


In the latter representation, t is called a parameter which may take certain real values. For 
each value t € Р, = Dj, we get a point (х,у) with y = f (x). 


Example3.7 Let function f : IR — IR be given in explicit form by 
у= (х) = х 43x42. 
The corresponding implicit representation is as follows: 


F(x,y) = х? + 3х+2—у = 0. 


We emphasize that, from a given implicit or parametric representation, it is not immediately 
clear whether it is indeed a function (or only a relation). 


Example 3.8 Consider the parametric representation 

x=rsint, y=rcost; t € [0,27]. (3.1) 
Taking the square of both equations and summing up, we get 

x +y = r° . (sin? t + cos? t) = r? 
which describes a circle with radius r and point (0,0) as centre. Each point x € (—r,r) 


is related to two values y; = Vr? — x? and у; = —– ур? — х2. Therefore equation (3.1) 
characterizes only a relation. 


The concept of bijective mappings can be immediately applied to a function of a real variable 
and we obtain: if a function f : Dr > Ку, Dy С К, is bijective, then there exists a function 
f^! which assigns to each real number y € Ry a unique real value x € Dy with 


peer 


Here it should be mentioned that, if Ау С R but Ry Æ R (in this case we say that Ry is a 
proper subset of IR), the inverse function of f : Dy — IR would not exist, since f would 
not be surjective in that case. Nevertheless, such a formulation is often used and, in order 
to determine f—', one must find the range Ry (in order to have a surjective mapping) or, 
equivalently, the domain Ру: of the inverse function of f. 
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We write f^! : Ry — R or Rf — Dy. Alternatively, we can use the notation 


f 2(9,31»2fG.x € Ру}. 


Since x usually denotes the independent variable, we can exchange variables x and y and 
write 


y=f Œ) 


for the inverse function of function f with у = f (x). The graph of the inverse function f~! 
is given by the mirror image of the graph of function f with respect to the line y = x. The 
graphs of both functions are illustrated in Figure 3.8. (In this case, function f is a linear 
function and the inverse function f^! is linear as well.) 


Figure 3.8 Graphs of functions f and f me 


Example 3.9 Let function f : Ry — [4, оо) with 
у=/0) =x +4 
be given. To determine the inverse function f ^! , we solve the above equation for x and obtain 


x —y—4 
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which yields 


Interchanging both variables, we obtain the inverse function 
y-f Q)-Xx-4. 


The domain of the inverse function f^! is the range of function f, and the range of 
function f—! is the domain of function f, i.e. 


Dy-1 = [4, оо) and Кул = R+. 


Analogously to our previous considerations in this chapter, we define composite functions 
of a real variable 


gof :Df > Ҝ 


with y = g(f(x)),x € Dy. Function f is called the inside function and function g is called 
the outside function. If there exists the inverse function f~! for a function f, we have 


flof=fof =1, 


Оу 0), 


where J is the identity function. We illustrate the concept of a composite function by the 
following examples. 


Example 3.10 Let functions f : R —^ Rand g :R— R with 
f(x) = 3x-2 and g(x) =х?-єх—1 
be given. We determine both composite functions f o g and g o f and obtain 
(о 4) 00) 2f(gG)) =f +x4 1) 2392 -x-1) -22 32 +3x—5 
and 
(д of)6) = g(f (69) = gx — 2) = (3х — 2)? + Bx - 2) - 1 = 9x? — 9x + 1. 
Both compositions are defined, since the range of the inside function is either set R 


(function f) or a subset (function g) while in both cases the outside function is defined 
for all real numbers. 
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Example 3.11 Given are the functions f : Dy > R and g : Dy > R with 
f@)=vVx+1, Dy-[-Loo) 
and 
g(x) = sin 3x, D, =R. 
For the composite functions we obtain 
(f o g)G) = f (gG)) =f (sin 3x) = vsin 3x + 1 
and 
(д of (x) = g(f 6) = g( Vx + 1) = sinGVx + 1). 


Composition f o g is defined since the range Ry = [—1, 1] is a subset of the domain Dy, and 
composition g o f is defined since the range Ry = [0, оо) is a subset of the domain Dj. 


3.3.2 Properties of functions 


Next, we discuss some properties of functions. We start with an increasing and a decreasing 
function. 


Definition 3.12 A function f : Dy — R is increasing (resp. strictly increasing) on 
an interval I С Dy if, for any choice of x; and x? in J with x; < x2, inequality 


SF Ол) < (хә) (resp. f (x1) < f (x2)) 
holds. 


Obviously, a strictly increasing function is a special case of an increasing function. The latter 
is also called a non-decreasing function in order to emphasize that there may be one or several 
subinterval(s) of J, where function f is constant. 


Definition 3.13 A function f : Dy — IR is decreasing (resp. strictly decreasing) on 
an interval I С Dy if, for any choice of x; and x2 in J with x; < x2, inequality 


fœ) = оо) (resp. f (x1) > f (2)) 
holds. 


A decreasing function is also called a non-increasing function. An illustration is given in 
Figure 3.9, where function f is strictly increasing on the interval /, but strictly decreasing 
on the interval Г. 
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f strictly f strictly 
increasing on J; decreasing on Г 


1, ГА z 
Figure 3.9 Monotonicity of function f : Dy > R. 
We note that a function f : Dy — Wy, which is strictly monotone on the domain (i.e. either 


strictly increasing on Dy or strictly decreasing on Dy, is bijective and therefore, the inverse 
function f—! : Wy — Dy exists in this case and is strictly monotone, too. 


Example 3.12 We consider function f : Dy — R with 


2 
fx) = 


This function is defined for all x 4 —1, i.e. Df = R \ (—1]. First, we consider the interval 
Iı = (—1, оо). Let xi, x2 € Д with x < x2. We get 0 < x1 + 1 < x2 + 1 and thus 


2 


> —— =f (x). 


UE arg EE 


Therefore, function f is strictly decreasing on the interval J). 
Consider now the interval Jy = (—оо, —1) and let x; < x2 < —1. In this case, we first get 
xı +1 « x2 +1 < 0 апа then 


fœ) = 


LR = А 
ЖЕТ ют a 


Therefore, function f is strictly increasing оп the interval (—oo,—1). The graph of 
function f is given in Figure 3.10. 
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Figure 3.10 Graph of function f in Example 3.12. 


Definition 3.14 A function f : Dy — R is said to be bounded from below (from 
above) if there exists a constant C such that 


ДО) > С (resp. f (x) < С) 


for all x € Dy. Function f is said to be bounded if f(x) is bounded from below and 
from above, i.e. 


Ifc) < С 


for all x € Dy. 


Example 3.13 We consider function f : IR > R with 
у=) = е – 2. 


Function f is bounded from below since f(x) > —2 for all х є К. However, function f 
is not bounded from above since f (x) can become arbitrarily large when x becomes large. 
Therefore, function f is not bounded. 
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Example 3.14 We consider function f : IR > R with 
y =f (x) = 3 + sin 2x. 


First, the function values of sin 2x are in the interval [—1, 1]. Therefore, the values of function 
f are in the interval [2,4]. Consequently, for C = 4 we get |f(x)| < C for allx € R and thus, 
function f is bounded. 


Definition 3.15 А function f : Df — R is called even (or symmetric) if 


f(-x) = 0). 


Function f is called odd (or antisymmetric) if 


je) = 7 @). 


In both cases, the domain Dy has to be symmetric with respect to the origin of the 
coordinate system. 


An even function is symmetric to the y-axis. An odd function is symmetric with respect to 
the origin of the coordinate system as a point. It is worth noting that a function f is not 
necessarily either even or odd. 


Example3.15 We consider function f : IR \ {0} —> R with 
3 1 
f(x) = 40° — 2х + -. 
X 
We determine f (—x) and obtain 


jeu коол е. 
= - 


= (e E f (x) 
= JL ; 


Thus, function f is an odd function. 


Example 3.16 Let function f : К — R with 


ДО) = 3х5 + x? 
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be given. Then we obtain 


Р(х) = 3(—x)® + (—x)? = 3x6 +x? = f(x). 


Hence, function f is an even function. 


Definition 3.16 A function f : Dy — R is called periodic if there exists a number T 
such that for all x є Dy with x+ T € Dy, equality f (x + T) = f(x) holds. The smallest 
real number Т with the above property is called the period of function f . 


Definition 3.17 А function f : Dy — R is called convex (concave) on an interval 
I € Dy if, for any choice of x; and x? in Z and 0 < А < 1, inequality 


Озу + (= Аз) < Уба) + (1 — Jf Ge) (32) 
(тезр./ бла + (1 Ао) > Аба) + @ — 9f 62)) өз) 
holds. 


If for 0 < à < 1 and x; Æ x2, the sign < holds in inequality (3.2) (resp. the sign > in 
inequality (3.3)), function f is called strictly convex (strictly concave). 


Ax; + (1 — А)» 
А € [0,1 


Figure 3.11 Illustration of a convex function f : Dy > К. 
The definition of a convex function is illustrated in Figure 3.11. Function f is convex on an 


interval J if for any choice of two points x; and x2 from the interval and for any intermediate 
point x from the interval [x1, x2], which can be written as x = Ax; + (1 — A)x», the function 
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value of this point (i.e. f (Ax1 + (1 — A)x2)) is not greater than the function value of the 
straight line through the points (x1, /(х1)) and (x2,f(x2)) at the intermediate point x. The 
latter function value can be written as Af (x1) + (1 — А) (x2). 


Checking whether a function is convex or concave by application of Definition 3.17 can be 
rather difficult. In Chapter 4, we discuss criteria for applying differential calculus, which are 
easier to use. Here we consider, for illustration only, the following simple example. 


Example 3.17 Let function f : К > R with 
f(x) =ax+b 


be given. Using formula (3.2) of the definition of a convex function, we obtain the following 
equivalences: 


Хх + (1 — Аю) 
e a-[Axyt+(1—A)m] +b 
«= алх + а(1 — à)x2 + b 
< 0 


Af (x1) + (1 — ADS (2) 

à - (axi +b) + (1 — A) - (axo + b) 
Аах + Ab + (1 — Аах + b — Ab 
0. 


= 
= 
= 
= 


Since the latter inequality is obviously true, the first of the above inequalities is satisfied too 
and hence, function f is convex. We mention that function f is also concave (all inequality 
signs above can be replaced by the sign >). So a linear function is obviously both convex 
and concave, but it is neither strictly convex nor strictly concave. 


3.3.3 Elementary types of functions 


Next, we briefly discuss some types of functions and summarize their main important 
properties. We start with polynomials and some of their special cases. 


Polynomials and rational functions 


Definition 3.18 The function P, : К — R with 
y = P Œ) = anx” а Е + аэх? + ax + ау 


with a, 4 0 is called a polynomial function (or polynomial) of degree n. The numbers 
a0, 41, . .., an are called the coefficients of the polynomial. 


Depending on the value of n which specifies the highest occurring power of x, we mention 
the following subclasses of polynomial functions: 


n=0: y= Ро(х) = ао constant function 
п=1: у= Р(х) = ах + ao linear function 
п=2: у= Р(х) = ах? + aix + ао quadratic function 


п= 3: y=P3(x)= азх? + ax? + ах + ag cubic function 


We illustrate the special cases of n = 1 andn = 2. 
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Casen = 1 In this case, the graph is a (straight) Jine which is uniquely defined by any two 
distinct points P, and P» on this line. Assume that P, has the coordinates (x1, y1) and P2 has 
the coordinates (x2, y2), then the above parameter a, is given by 


_ 2 = У 
0 — ХІ 


а] 


and is called the slope of ће line. The parameter ao gives the y-coordinate of the intersection 
of the line with the y-axis. These considerations are illustrated in Figure 3.12 for ао > 0 and 
a, > 0. The point-slope formula of a line passing through point P, with coordinates (x1, y1) 
and having a slope aj is given by 


y-yr-aj:(x—x). 


Figure 3.12 А line with positive slope а]. 


Example 3.18 Assume that for some product, the quantity of demand D depends on the 
price p as follows: 


1 
D =f (p) = 1,600 — 5 -p. 


The above equation describes the relationship between the price p of a product and the 
resulting demand D, i.e. it describes the amount of the product that customers are willing 
to purchase at different prices. The price p is the independent variable and demand D is the 
dependent variable. We say that f is the demand function. Since both price and demand can 
be assumed to be non-negative, we get the domain and range as follows: 


Dp = {(p €R|0<p<3,200) and Ry ={DER|0<D< 1,600}. 


If we want to give the price in dependence on the demand, we have to find the inverse 
function f—! by solving the above equation for p. We obtain 


1 
—.р=1,600—Ю 
3 
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which corresponds to 
р= D) = 2. (1,600 — D) = 3,200 — 2D. 


Here we did not interchange the variables since ‘D’ stands for demand and ‘p’ stands for 
price. Obviously, we get Dy-1 = Ry and Ry-1 = Dy. 


Case n = 2 The graph of a quadratic function is called a parabola. If az > 0, then the 
parabola is open from above (upward parabola) while, if a? < 0, then the parabola is open 
from below (downward parabola). A quadratic function with a? > 0 is a strictly convex 
function while a quadratic function with a? < 0 is a strictly concave function. The case of a 
downward parabola is illustrated in Figure 3.13. Point P is called the vertex (or apex), and 
its coordinates can be determined by rewriting the quadratic function in the following form: 


у= уо = a: (x — x0), (3.4) 


where (xo, yo) are the coordinates of point P. We get 


2 
ai aj 
xo = —— and == +а 
2а2 70 4а» 0 
Y 4 


а <0 


Figure 3.13 A downward parabola y = ap - (x — ход)? + yo. 


Points x with у = f(x) = a2x? + а\х + ао = 0 are called the zeroes or roots of the quadratic 
function. In the case of a > 4a2a0, a quadratic function has the two real zeroes 


—aj +, Ja — 4a2a0 


2a» 


X12 = 


(3.5) 


In the case of a = 4a2a9, we get a zero x, = x2 occurring twice. In the case of a < 4a2a0, 
there exist two zeroes which are both complex numbers. If a quadratic function is given in 


Relations; mappings; functions 129 


normal form y = x* + px + q, formula (3.5) for finding the zeroes simplifies to 


xi 2 -E + JP. -4 
б PEE р 


Example 3.19 Consider the quadratic function P? : R > R with 
y = Po(x) = 2x? — 4х +4. 
First, we write the right-hand side as a complete square and obtain 
y=2-@? -2x +2) = 2. (& 1? +2, 
which can be rewritten as 
у-2=2.(х— 1)?. 
From the latter representation we see that the graph of function / is an upward parabola with 
the apex P = (1,2) (see equation (3.4)). We investigate whether the inverse function f~! 


exists. We try to eliminate variable x and obtain 


yz 2 
—__=(x- 1). 
5 (x — 1) 


If we now take the square root on both sides, we obtain two values on the right-hand 
side, i.e. 


— — = +(x - 1) 


which yields the following two terms: 


ly —2 —2 
х= 1+ m and х=1— == 


Therefore, the inverse function f7! does not exist since the given function f is not injective. 


If we restrict the domain of function f to the interval Dy = [1, 00), then the inverse function 
would exist since in this case, only solution x; is obtained when eliminating variable x. Thus, 
we can now interchange both variables and obtain the inverse function f~! with 


yaft@ait Z, Da = 200). 


The graph of function f and the graph of the inverse function f^! for the case when the 
domain of function f is restricted to Dy = [1, 00) are given in Figure 3.14. 
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Figure 3.14 Graphs of functions f and f -lin Example 3.19. 


Next, we consider the general polynomial P; of degree n with 


Р(х) = anx" + ay 1x" | +- + ajx + ao, 


а, + 0. 


In the following, we give two properties of general polynomials and start with the fundamental 
theorem of algebra. 


THEOREM 3.4 (fundamental theorem of algebra) Any polynomial P,, of degree n can be 
written as a product of polynomials of first or second degree. 


Another important property of polynomials is given in the following remainder theorem. Let 
deg P denote the degree of polynomial P. 


THEOREM 3.5 (remainder theorem) Let P : R — R and О: К — К be polynomials 
with deg P > deg О. Then there always exist unique polynomials S : К —> R and R : К —> К 
such that 

P=S-Q+R, 


(3.6) 
where polynomial R is called the remainder and deg R < deg Q. 
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If R(x) = 0, then we say either that polynomial Q is a factor of polynomial P or that 
polynomial P is divisible by polynomial Q. 


Consider the special case Q(x) — x — xo, i.e. polynomial Q is a linear function and we have 
deg Q(x) = 1. Then it follows from Theorem 3.5 that deg R = 0, i.e. R(x) = r is constant. 
This means that we can write polynomial P in the form 


P(x) = S(x) - (x — xo) +r. 


Consequently, for x = xo we get the function value Р(хо) = r. Furthermore, we have the 
following equivalence: 


Р(х) = S(x) : (x — xo) €» P(xo) = 0. (3.7) 


We can summarize the above considerations as follows. If xo is a zero of a polynomial P; 
with deg P, = n (i.e. a root of the equation P(x) = 0), then, according to the comments 
above, this polynomial can be written as 


Р(х) = (x — xo) - Sn-1 Œ), 


where $„—1 is a polynomial with deg 5, = n — 1. On the one hand, it follows from 
equivalence (3.7) that polynomial P, can have at most n different zeroes. On the other hand, 
it follows from Theorem 3.4 that polynomial P, has exactly n (real or complex) zeroes. 
If the polynomial P, has the complex zero x = a+ bi, then the complex conjugate x = a — bi 
is a zero of polynomial P,, too. Therefore, polynomial P, has the factors 


[x - @+ bi] -[x- a- n) = x? — 2ax + (à? +0), 


which is a polynomial of degree two. So we can confirm Theorem 3.4. A real zero хо leads 
to a polynomial x — хо of first degree and complex zeroes a + bi and a — bi to a polynomial 
x? — 2ax + (a? + b?) of second degree in the product representation of polynomial Р,. As a 
consequence of the above considerations, we can conclude for example that a polynomial Р» 
of degree three has either three real zeroes or only one real zero (and two complex zeroes). 


It may happen that certain factors x — x; occur repeatedly in the above representation. This 
leads to the definition of the multiplicity of a zero. 


Definition 3.19 Let f = P, : R — R bea polynomial of degree n with 
Р(х) = (Œ — xo)" Spex) апай Sy) #0. 


Then хо is called a zero (or root) of multiplicity k of polynomial Py. 


If xo is a zero of odd multiplicity, then the sign of function f changes ‘at’ point xo, 
i.e. there exists an interval about xo such that function f has positive values to the left of xo and 
negative values to the right of xo or vice versa. In contrast, if xo is a zero of even multiplicity, 
there is an interval about xo such that the sign of function f is the same on both sides of 
point хо. 


Next, we investigate the question of how to compute the function value of a polynomial 
easily. To this end, we introduce Horner 5 scheme. 
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Calculation of function values of polynomials using Horner $ scheme 


1 


Р„(хо) = anxo + an-1x0 ++ aX + a1xo + ао 


= {+++ [(anxo + An—1) xo + An—2] Xo + +++ + a1] xo + ao 
a 


The above computations can be easily performed using the following scheme and starting 
from the left. 


Horner s scheme 


аһ An—| An_2 5 42 А\ Ао = Р,„(хо) 


The advantage of Horner’s scheme is that, in order to compute the function value of a 
polynomial, only additions and multiplications have to be performed, but it is not necessary 
to compute powers. 


Example 3.20 We consider the polynomial P5 : R > R with 
Ps(x) = 2x5 + 4x4 + 8x? — Ax? — 10x. 


Since ao = 0, we get the zero хо = 0 and dividing P4(x) by x — xo = x yields the polynomial 
P4 with 


P4(x) = 2x* + Ax? + 8x? — 4x — 10. 


Checking now the function values for x = 1 and x3 = —1 successively by Horner’s scheme, 
we obtain 
2 4 8 —4 —10 
x=x =l 2 6 14 10 
6 
2 
4 


In the above scheme, we have dropped the arrows and the plus signs for simplicity. We have 
found that both values x? = 1 and x3 = —1 are zeroes of the polynomial Рл (х). From the last 
row we see that the quadratic equation 2x? + 4x + 10 = 0 has to be considered to determine 
the remaining zeroes of polynomial P4. We obtain 


x45 =—l + 1—5 
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which yields the complex zeroes 
x4 — —1-42i and х5 = —l — 2i. 


So we have found all five (real and complex) zeroes of polynomial Р; (х), and we get the 
factorized polynomial 


Ps(x) = 2x(x — (х + Do? + 2х + 5). 


Example 3.21 Ме consider the polynomial P5 : К > R with 
Ps(x) = х? — 5x4 + 40x” — 80x + 48 


and determine the multiplicity of the (guessed) zero x = 2. We obtain: 


1 —5 0 40 —80 48 
х=2 2 -6 —12 56 —48 
1 —-3 -6 28 —24 0 
X= 2 2 —2 —16 24 
1 —-1 -8 12 0 
x-—2 2 2 —12 
1 1 —6 0 
x22 2 6 
1 3 0 


Thus, x = 2 is a zero of multiplicity four, and the polynomial Р; can be written as a product 
as follows: 


Ps(x) = x —2)*- (x +3). 


The last factor is obtained from the remaining polynomial Sı with S1 (x) = x + 3 which has 
the zero x = —3. 


In general, it is difficult to find all zeroes of a polynomial of a higher degree. In Chapter 4, we 
discuss numerical procedures for finding the zeroes approximately. The following theorem 
gives some relationships between the coefficients of a polynomial and the zeroes. 


THEOREM 3.6 (Vieta’s theorem) Let polynomial P, : К > R with 


1 


Р(х) = 1-x" + a, 1x" | ++ + ax? + aix + ao 
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be given and let х], х2, · · · , x, be the zeroes of polynomial P,,. Then: 
X| c X2 + + Xp = —An-1 
x1X2 + X1X3 + +++ d Xn—1Xn = ап—2 
x1X2X3 + Х1Х2Х4 ++ + Xn—2Xn—1Xn = —an-3 
X]X2X3 +++ Xn = (-1)"ao 


In the special case of a quadratic function Р›(х) = x? + px + q, Vieta's theorem yields the 
two equations 


xı +2 = -—p and xX = q. 


Example 3.22 Let the polynomial P3 : R > R with 
P3(x) = х? — 2x” — 5x +6 


be given. When looking for integer zeroes, it follows from the last equality of Theorem 3.6 
that these zeroes must be a divisor of ay = 6. Therefore, only x = +1, x = +2 and x = +3 
are candidates for an integer zero. Checking the function values of P3 at these points e.g. by 
Horner’s scheme, we obtain the zeroes 


X] = 1, x2 = —2 and хз —3 
and thus the factorized polynomial 


Рз = (х— 1): (х+2). (хХ—3). 


Finally, we consider the quotient of two polynomials, Р апа О. 


Definition 3.20 A function T : Dr — R with T(x) = P(x)/Q(x) = (P/Q) (x) is 
called a rational function. The rational function T is called proper if deg P < deg O 
and improper if deg P > deg Q. 


The domain Dr of function T is given by all real numbers for which the polynomial in the 
denominator is different from zero, i.e. Dr = {x € R | O(x) z 0}. 


Example 3.23 Let the cost C of producing x units of a product be equal to 
C(x) = ax? + bx + с, х > 0. 


Here c > 0 describes the fixed cost of production, and а and b are real parameters such that 
C is a non-negative and strictly increasing function. Then the average cost C; (x), measuring 
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the cost per unit produced, given by 


ax? + bx +c 


x 


Са(х) = 
is an improper rational function which can be rewritten as 


Са(х) = ax + b + T 
x 


By means of polynomial division, any improper rational function can be written as the sum 
of a polynomial and a proper rational function. If deg P 7 deg Q, then equation (3.6) can be 
written as 


P(x) R(x) 
DONE eu DEVE 
Qu) ДЕТ; 


provided that x is not a zero of polynomial Q. To get the latter representation, we consider 
both polynomials P and Q in decreasing order oftheir powers and divide in each step the first 
term of P by the first term of Q. The result is multiplied by Q and the product is then subtracted 
from P. This yields a new polynomial P, having a smaller degree than polynomial Р„. Now 
the first term (with the largest exponent) of polynomial Р! is divided by the first term of 
polynomial О, the resulting term is multiplied by polynomial О and subtracted from P1, 
yielding polynomial Р» and so on. The procedure stops if some resulting polynomial P; has 
a smaller degree than polynomial Q. This procedure is illustrated by the following example. 


Example 3.24 Let polynomials Р: R > R and Q : R — R with 
Р(х) 22x — 3x3 + 2x? — x +4 and О(х) =x* —x+3 


be given. The function T : Dr — IR with T(x) = (P/Q)(x) is improper, and by polynomial 
division we obtain: 


4 3 2 
ao MAN 2 E D : @ —x+43)= 2? -x— 54 D 
—(2x*— 2x°+ бх) 
—x)— 42- x+ 4 
—(-3+ x-3x 
—5x2 + 2x+ 4 
— (—5x* + 5x—15) 
— 3x+ 19 


In this case, we have written function T as the sum of the polynomial S of degree two with 
S(x) = 2x? — x — 5 and the proper rational function А/О with 


(R/Q)(x) = (—3x + 19)/(x? — x + 3). 
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Power functions 


Definition 3.21 The function f : Dy — R with 
Ur гЄ К, 


is called a power function. In dependence on the value of r, ће domain D; and the 
range Ry are given as follows: 


(1) re (,3,5,...] © N Dy = (—00,00), Ку = (—00, оо); 
(2) r € (2,4,6,...] € N: Dy = (—00,0o), Ry = [0, со); 
(3) r € RAN : Dy = (0, оо), Ку = (0, оо). 


In case (1), function f is strictly increasing, unbounded and odd. Function f is strictly concave 
on the interval (—oo, 0] and strictly convex on the interval [0, оо). In case (2), function f is 
strictly decreasing on the interval (— оо, 0], strictly increasing on the interval [0, оо), bounded 
from below, even and strictly convex. Since function f is not injective in this case (note that 
f (—x) = —f (x)), the inverse function does not exist in this case (if one restricts the domain 
to the interval [0, оо), then function f is bijective and the inverse function exists). In case (3) 
with r > 0, function f is strictly increasing and bounded from below. Moreover, ifr > 1, 
function f is strictly convex and, if 0 < r < 1, function f is strictly concave. Case (3) 
with positive value r includes as a special case so-called root functions, where r — 1/n with 
nceN,ie. 


f@ =x" = x. 


In the case of a root function, number zero belongs to the domain and range of f, i.e. 
Dy = Ry = [0, оо). In case (3) with r < 0, function f is strictly decreasing, bounded from 
below and strictly convex. 


Obviously, the inverse function f7! of a power function f with f(x) = х” exists and is 
equal to y = x!/". The graph of some power functions for special values of r are given in 
Figure 3.15. If r € Q, we say that power function f is an algebraic function. 


Example 3.25 Let function f : (0,00) — (0, оо) with 
Ја) 2x9 


be given. Itis a power function with the rational exponent r — 2.5. This function is bijective, 
and so the inverse function exists. Solving for variable x, we obtain 


х= у1/25 = y04. 


Exchanging both variables x and y, we get the inverse function f^! with 


yox Dy-1 = Ry = (0, оо). 
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Figure 3.15 Graphs of some power functions. 


Exponential and logarithmic functions 


Exponential and logarithmic functions play an important role in many economic applications, 
e.g. growth and decline processes. 
Definition 3.22 The function f : IR — (0,00) with 
fœ =a, а> 0, а#1, 


is called an exponential function. 
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For all positive values of a, the graph of function f goes through point (0, 1). The graph of 
the exponential function for three values of a is given in Figure 3.16. For all values of a, 
the exponential function is only bounded from below (and therefore unbounded). If a — 1, 
the exponential function is strictly increasing, while for 0 < a < 1 the exponential function 
is strictly decreasing. All exponential functions are strictly convex. We note that for an 
exponential function with base e, the notation y — e* is sometimes also written as y — exp(x). 


Since an exponential function f is bijective for all values a > 0 with a 5 1 on the domain 
Dy, for each of these values of a the inverse function exists. The inverse function of the 
exponential function is the logarithmic function defined as follows. 


Figure 3.16 Graphs of some exponential functions. 


Definition 3.23 The function f : (0, оо) — R with 
f(x) = log, x, а> 0, а +1, 


is called a logarithmic function. 
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Y 


Figure 3.17 Graphs of some logarithmic functions. 


The graphs of the logarithmic functions with the two frequently used bases a = e anda = 10 
are given in Figure 3.17. Since the logarithmic functions are the inverse functions of the 
corresponding exponential functions, all logarithmic functions go through the point (1,0). 
Logarithmic functions with a base a 1 are strictly increasing, and logarithmic functions 
with a base a, where 0 < a < 1, are strictly decreasing. All logarithmic functions are 
unbounded. Logarithmic functions with a base a > 1 are strictly concave while logarithmic 
functions with a base a, where 0 < a < 1, are strictly convex. It is worth emphasizing that 
both logarithmic and exponential functions are only defined for positive values of the base, 
where a base equal to one is excluded. 


Example 3.26 We consider function f : Dy — R with 
y =f (x) = In(3x + 4). 


Since ће logarithmic function is defined only for positive arguments, the inequality 3x--4 > 0 
yields the domain 


4 
p, - |хевіх 31. 


The logarithmic function has the range Ry = R, and therefore function f also has the range 
Ry = R. Since the logarithmic function is bijective, function f is bijective as well, and thus 
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the inverse function f^! exists. We obtain 


y = In(3x+ 4) 
е =3x+4 

1 

= —.(е/ – 4). 
r=. (e—4) 


Exchanging now variables x and y, we have found the inverse function f^! given by 
ae (e 
“а: 


with the domain Dy = Ку = К. 


Trigonometric functions 


Trigonometric (or circular) functions are defined by means ofa circle with the origin as centre 
and radius r (see Figure 3.18). Let P be the point on the circle with the coordinates (r, 0) and 
assume that this point moves now counterclockwise round the circle. For a certain angle x, 
we get a point P on the circle with the coordinates (h, v). Then the sine and cosine functions 
are defined as follows. 


cota 


Figure 3.18 The definition of the trigonometric functions. 
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Definition 3.24 The function f : R — [1,1] with 
f(x) = sinx = Е 
7 
is called the sine function. The function f : R — [1,1] with 
h 
f(x) = cosx = – 
= 


is called the cosine function. 


The graphs of the sine and cosine functions are given in Figure 3.19 (a). The sine and cosine 
functions are bounded and periodic functions with a period of 27, i.e. we have for k € Z: 


sin(x + 2k) = sinx and cos(x + 2k) = cosx. 


The sine function is odd, while the cosine function is even. These functions are often used 
to model economic cycles. 


Remark Usually, angles are measured in degrees. Particularly in calculus, they are often 
measured in so-called radians (abbreviated rad). An angle of 360° corresponds to 2л rad, 
i.e. we get for instance 


sin 360° = sin 2л rad. 


It is worth noting that, using radians, the trigonometric functions have set R or a subset of IR 
as domain and due to this, the abbreviation rad is often skipped. 


When using powers of trigonometric functions, we also write e.g. sin? x instead of (sin x)? or 
cos? (2x) instead of (cos 2x)?. By means ofthe sine and the cosine functions, we can introduce 
two further trigonometric functions as follows. 


Definition 3.25 The function f : Dy — IR with 


f@ =tnx= ol and ру = |е] > +kr, kez} 
x 


is called the tangent function. The function f : Dy > R with 


h 
f@)=ctx=——=- and Ру = (кє |х т, k e Z) 
sin 0 


is called the cotangent function. 
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y = cot т y=tan x 


y A 


Yx 


(b) Functions y = tan x and y = cot x 


Figure 3.19 Graphs of the trigonometric functions. 


The graphs of the tangent and cotangent functions are given in Figure 3.19 (b). The tangent 
and cotangent functions are periodic functions with a period of z, i.e. we have for k € Z: 


tan(x + Кл) = tanx and cot(x + Кл) = сох. 


Both functions are unbounded and odd. 
In the following, we review some basic properties of trigonometric functions. 


Some properties of trigonometric functions 


(1) sin(x € y) = sin x cosy + cos x sin y; 
(2) cos(x + y) = cosx cosy T sin x sin y; 
tanx + tany . 


3) t ry) = — ; 
ымаа y) 1 + tanx tan y 
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cotxcoty-rl 


(4) сох +y) = 


coty + cotx ' 
P Л . x xF 
(5) sinx + siny = 2- sin cos Я 
2 
x + x— 
(6) cosx + cosy = 2- cos Z cos I 
.Xcy . X—y 
cosx — cosy = —2. sin - sin ы 
2 2 
А л л : 
(7) sin (s =) = + cosx, cos (x + =) = F sinx; 


(8) sin? х + cos? x = 1; 


(9) 1+ tan?x 2 —.—. 
COS^ X 


Properties (1) up to (4) are known as addition theorems of the trigonometric functions. For 
the special case of x — y, properties (1) and (2) turn into 


sin 2x = 2 - sin x - cosx and cos 2x = cos? x — sin? x. (3.8) 


Property (8) is also denoted as a Pythagorean theorem for trigonometric functions. As an 
illustration, we prove the identity given in Property (9) and obtain 


sin? x cos? x + sin? x 1 


1 +tan? x = 1+ „— = 5 = ——. 
COS“ x COS“ x COS“ x 


The last equality follows from property (8) above. 


Since all four trigonometric functions are not bijective mappings, we have to restrict their 
domain in order to be able to define inverse functions. If we solve equality y = sin x for x, we 
write x — arcsin y. The term arcsin y gives the angle x whose sine value is y. Similarly, we 
use the symbol ‘arc’ in front of the other trigonometric functions when solving for x. Now 
we can define the arc functions as follows. 


Definition 3.26 The function 
f:[-L1] [-5. =] with f(x) = arcsinx 
is called the arcsine function. The function 
f:[-11] — [0,7] with f(x) = arccosx 
is called the arccosine function. The function 
f : (~œ, оо) > (= >) with f(x) =агсїапх 
is called the arctangent function. The function 
Г: (оо, оо) > (0,7) with f(x) = arccot x 


is called the arccotangent function. 
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In Definition 3.26, the ranges of the arc functions give the domains of the original trigono- 
metric functions such that they represent bijective mappings and an inverse function can 
be defined. The graphs of the arc functions are given in Figure 3.20. All the arc functions 
are bounded. The arcsine and arctangent functions are strictly increasing and odd while the 
arccosine and arccotangent functions are strictly decreasing. Moreover, the arcsine and arc- 
cotangent functions are strictly concave for x < 0 and they are strictly convex for x > 0. 
The arccosine and arctangent functions are strictly convex for x < 0 and strictly concave for 
x > 0 (where in all cases, x has to belong to the domain of the corresponding function). 


у, 
T 
V, 
= arcsin x 
z Г у = arccos x 
1 
A ИС 
> Е } > 
T a 
i Ng 
X Е 
_т у = соз T 
zl 
(a) Function y — arcsin x (b) Function y = arccos = 
У ty=cot т 


y = arctan x y = arccot x 


(c) Function y = arctan = (d) Function y — arccot x 


Figure 3.20 Graphs of the inverse functions of the trigonometric functions. 


Overview on elementary functions 


In Table 3.1, we summarize the domains and ranges of the elementary functions. 
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Table 3.1 Domains and ranges of elementary functions 
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Function f with у = f(x) Dy Ry 
у=с cis constant —00 < x < оо у= с 
у= х" п= 2,3,... —oo«x«oo —oo«yc«oo 
у= х ає Е О <х<оо 0 <у<оо 
у= yx п= 2,3,... 0xx«oo 0<y<o 
у= ах а> 0 —00 < x < оо 0 <у<оо 
y = loga x a>0,a4l 0«x«oo —oo«y«oo 
y= sinx —oo«x«oo —-l<y<l 
y = cosx —oo0 < x < о0о -l<y<l 
л 
у = tanx оит -оо «y «oo 
y — cotx xzkm -оо «y «oo 
л л 
у = arcsinx -l<x<l =з == у 
y = arccos x -l<x<l О<х<л 
л 
y = arctan x —0 <x < © Wo RE 
y — arccot x —00 «x «oo О <х<л 
EXERCISES 


3.1 Let A = {1,2,3,4, 5}. Consider the relations 


R= {(1, 2), (1,4), (3, 1), (3,4), (3, 5), (5, 1), (5,4)} CA x A 


and 


S = {(aj,a2) € Ax Alay + a = 6}. 


(a) Illustrate R and S by graphs and as points in a rectangular coordinate system. 
(b) Which of the following propositions are true of relation R: 


1R2, 2R1, 


3R1, 


(c) Find the inverse relations for R and S. 
(d) Check whether the relations R and S are mappings. 


{2,3,4,5} = (a € A | 1Ra}? 
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3.2 Consider the following relations and find out whether they are mappings. Which of 
the mappings are surjective, injective, bijective? 


(a) (b) (c) (d) 


су & 
Ww кә 

л Ww A 
У 6 

Сс 8 

ON BN н 

- nN We 


© A ос а 


luc 


33 LetA = В = {1,2,3} and C = {2,3}. Consider the following mappings f : 4 — В 
апад: C — A withf(1) = 3,f (2) = 2,/ (3) = 1 and g(2) = 1,g(3) = 2. 


(a) Illustrate f, g,f o g and g of by graphs if possible. 

(b) Find the domains and the ranges of the given mappings and of the composite 
mappings. 

(c) What can you say about the properties of these mappings? 


3.4 Given is the relation 


Е = (61,32) € R? | |x2| = x1 + 2). 


Check whether F or F^! is a mapping. In the case when there is a mapping, find the 
domain and the range. Graph F and F71. 


3.5 Given are the relations 
F = {(1,x2) | x2 = х3} with x; € {—3, –2,— 1,0, 1,2,3} 
and 
G = {(x,y) € R? | 9x? + 2y? = 18}. 


Are these relations functions? If so, does the inverse function exist? 
3.6 Given are the functions f : Dy — R and g : Dg — R with 


f(x) 22x41 and g(x) 2x? — 2. 


Find and graph the composite functions g o f and f og. 
3.7 Given are the functions f : R — R+ and g : К> R with 


Хо) = е" and g(x) = —x. 


(a) Check whether the functions f and g are surjective, injective or bijective. Graph 
these functions. 

(b) Find f! and g^! and graph them. 

(c) Find f o g and g o f and graph them. 


3.8 


3.9 


3.11 


3.12 
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Find a € R such that f : Dy = [a, оо) > R with 


у= (х) =x? +2x-3 


being a bijective function. Find and graph function f~!. 
Find domain, range and the inverse function for function f : Dy > R with y = f (x): 


= МХ, b) у= (x-2)y. 
© у= Xi © у= 6-2) 
Given аге the polynomials P5 : R > R and P? : R > К with 


Р(х) = 2х5 — 6x* — бх? + 22x? — 12x and P(x) = (x — 1)2. 


(a) Calculate the quotient P5/P» by polynomial division. 
(b) Find all the zeroes of polynomial Ps and factorize P5. 
(c) Verify Vieta's formulae given in Theorem 3.6. 

(d) Draw the graph of the function P5. 


Check by means of Horner's scheme whether x; = 1,x2 = —1,x3 = 2,x4 = —2 are 
zeroes of the polynomial Pg : К — R with 


Pex) = xf 42 — xt — xX? 4 2? — x — 2. 


Factorize polynomial P6. 
Find the domain, range and the inverse function for each of the following functions 
Ji: Dg > R with y; = fi(x): 
(a) у =sinx, yo =2sinx, y3=sin2x, ул = ѕіпх +2 and 
ys = sin(x + 2); 
(D yp =e*, у= 2е*, уз=е”, уу=е*+2 and ys =e*t?. 
Graph the functions given in (a) and (b) and check whether they are odd or even or 
whether they have none of these properties. 
Given are the following functions f : Dy > R with y = f (x): 


(а) у= ах; (Ы) у= lnx; (с) у= 3х2 + 5; 
(d у= У4— х?; (е) y=lt+e*; (f у= Мх. 
Find the domain and range for each of the above functions and graph these functions. 


Check where the functions are increasing and whether they are bounded. Which of 
the functions are odd or even? 


4 Differentiation 


In economics, there are many problems which require us to take into account how a function 
value changes with respect to small changes of the independent variable (e.g. input, time, etc.). 
For example, assume that the price of some product changes slightly. The question is how does 
this affect the amount of product customers will buy? A useful tool for such investigations 
is differential calculus, which we treat in this chapter. It is an important field of mathematics 
with many applications, e.g. graphing functions, determination of extreme points of functions 
with or without additional constraints. Differential calculus allows us to investigate specific 
properties of functions such as monotonicity or convexity. For instance in economics, cost, 
revenue, profit, demand, production or utility functions have to be investigated with respect 
to their properties. In this chapter, we consider functions f : Dy — IR depending on one real 
variable, i.e. Dy € R. 


4.4 LIMIT AND CONTINUITY 


411 Limit of a function 


One of the basic concepts in mathematics is that of a limit (see Definition 2.6 for a sequence). 
In this section, the limit of a function is introduced. This notion deals with the question of 
which value does the dependent variable y of a function f with y = f(x) approach as the 
independent variable x approaches some specific value хо? 


Definition 4.1 The real number Z is called the limit of function f : D; > R as x 
tends to xo if for any sequence {xn} with x, 4 xo, x» € Dy, n = 1,2,..., which 
converges to xo, the sequence of the function values {f (x,)) converges to L. 


Thus, we say that function f tends to number L as x tends to (but is not equal to) xo. As an 
abbreviation we write 


lim f(x) = L. 
х-х0 


Note that limit L must be a (finite) number, otherwise we say that the limit of function f as x 
tends to xo does not exist. If this limit does not exist, we distinguish two cases. If L = -Eco, 
we also say that function f is definitely divergent as x tends to xo, otherwise function f is 
said to be indefinitely divergent as x tends to xo. 
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In the above definition, the elements of sequence {xn} can be both greater and smaller than хо. 
In certain situations, only the limit from one side has to be considered. In the following, we 
consider such one-sided approaches, where the terms of the sequences are either all greater 
or all smaller than xo. 


Definition 4.2 The real number L, (real number Гл) is called the right-side (left-side) 
limit of function f : Dy — R as x tends to xo from the right side (left side) if for any 
sequence {xn} with x, > xo (x, < xo), Xn € Dy, n = 1,2,..., the sequence of the 
function values {f (xn)} converges to Lr; (converges to L1). 


We also write 


lim fx) = L; and lim К Хох) = Ц 


x—xo4-0 X—X0— 


for the right-side and left-side limits, respectively. A relationship between one-sided limits 
and the limit as introduced in Definition 4.1 is given by the following theorem. 


THEOREM 4.1 The limit of a function f : Dy — R as x tends to x exists if and only if 
both the right-side and left-side limits exist and coincide, i.e. 


lim f(x) = lim IO = lim f (x). 
X—xo— x—XQ 


x—x94-0 


We note that it is not necessary for the existence of a limit of function f as x tends to хо that 
the function value f (xo) at point xo be defined. 


Example 4.1 Let function f : R \ {0} —> R with 


fo. 4 
x 


be given. We want to compute 
L= lim f(x). 
x0 
However, since 
: A x E ч =x 
lim f(x)= lim -= 1 and lim f(x)—- lm — —-1, 
—04-0 x—04-0 x x— 0—0 x>0-0 x 


the limit of function f as x tends to zero does not exist. 


Example 4.2 Let function f : R \ {0} —> R with 


f(x) = sin 1 
x 
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be given. If we want to compute 
L= lim f (x), 
x0 
we find that even both one-sided limits 


І. = lim f(x) and L= lim f(x) 
x—04-0 x—0—0 
do not exist since, as x tends to zero either from the left or from the right, the function values 
of the sine function oscillate in the interval [—1, 1], i.e. the function values change very 
quickly between the numbers —1 and 1 (even with increasing frequency as x tends to zero). 
Thus, the limit of function f as x tends to zero does not exist and function f is indefinitely 
divergent as x tends to zero. 


Next, we give some useful properties of limits. 
THEOREM 4.2 Assume that the limits 

Jim iG) — Li ай — lim fax) = Lo 
exist. Then the following limits exist, and we obtain: 


() lim [e «e| = Jim fie) + lim fo) = Li + La; 


Q) lim [A60 — 560] = tim Л) — lim 60 = Ly — La; 

G) lim [A60 50] = fim iO lim о) = Ly - La; 
"E A | | 

(4) lim яа = Т 0) h provided that L2 Æ 0; 

(5) In VA) = Hm f (x) = vL provided that Г. > 0; 

шше EZ о] = 1; 


im | — айл 


(7 lim [a] = al 


x—> xo 


Example 4.3 Given the function f : Dy — R with 


2 2 
EM алаа 
x 


we compute the limit 


L= lim ЈО), 
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Applying Theorem 4.2, we obtain 


i "EXER Шыу + Slims 41% —1 et rom 
x1 x ту] х 1 


Example 4.4 Given the function f : Dy — К with 


мх — 


x 
x—4' 


fx) = 
we compute the limit 
L= lim f(x). 
x4 


If we apply Theorem 4.2, part (4), and separately determine the limit of the numerator and the 
denominator, we find that both terms tend to zero, and we cannot find the limit in this way. 
Therefore, we rationalize the numerator by multiplying the numerator and the denominator 
by ./x + 2 and obtain: 


psg аа а ngu с кк ИК ИБР 
a4 (х—4)(/х+2) “2546-46040 254.442 /4+2 4 


4.1.2 Continuity of a function 


Definition 4.3 A function f : Dy — R is said to be continuous at xo € Dy if the limit 
of function f as x tends to xo exists and if this limit coincides with the function value 


f (xo), i.e. 
‚диш (x) = f (xo). 


Alternatively, we can give the following equivalent definition of a continuous function at ху 
using the (6 — &) notation. 


Definition 4.3* А functionf : Dy — R is said to be continuous at xo € Dy if, for any 
real number є > 0, there exists a real number ё (=) such that inequality |x — xo| < (=) 
implies inequality |f (x) — f (xo)| < =. 


We illustrate the latter definition in Figure 4.1. Part (a) represents a continuous function 
at хо. This means that for any = > 0 (in particular, for any arbitrarily small positive в), we 
can state a number ô depending on = such that for all x from the open interval (xo — ô, xo + ô) 
the function values f (x) are within the open interval (f(x) — &,f (x) + €) (ve. in the dashed 
area). In other words, continuity of a function at some point x9 € Dy means that small 
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210—5 To  rgdó 


(a) Function f is continuous at xo 


X9—ó To  xo4ó 


(b) Function f is discontinuous at zo 


Figure 4.1 Continuous and discontinuous functions. 


changes in the independent variable x lead to small changes in the dependent variable y. 
Figure 4.1(b) represents a function which is not continuous at хо. For some (small) = > 0, 
we cannot give a value ô such that for all x € (xo — ô, хо + ô) the function values are in the 
dashed area. 


If the one-sided limits of a function f as x tends to xo are different (or one or both limits do 
not exist), or if they are identical but the function value f (xo) is not defined or value f (xo) is 
defined but not equal to both one-sided limits, then function f is discontinuous at хо. Next, 
we classify some types of discontinuities. 


If the limit of function f as x tends to xo exists but the function value f (xo) is different or 
function f is even not defined at point xo, we have a removable discontinuity. In the case 
when function f is not defined at point xo, we also say that function f has a gap at xo. 


A function f has a finite jump at хо if both one-sided limits of function f as x tends to хо — 0 
and x tends to xo 4- 0 exist and they are different. The following discontinuities characterize 
situations when at least one of the one-sided limits of function f as x tends to хо + 0 does 
not exist. If one of the one-sided limits of function f as x tends to хо + 0 exists, but from the 
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other side function f tends to оо or —co, then we say that function f has an infinite jump at 
point хо. A rational function f = P/Q has a pole at point хо if Q(xo) = 0 but P(xo) Æ 0. 
(As a consequence, the function values of f as x tends to хо — 0 or to хо + 0 tend either 
to oo or —oo, i.e. function f is definitely divergent as x tends to xo.) The multiplicity of the 
zero xo of polynomial Q defines the order of the pole. In the case of a pole of even order, 
the sign of function f does not change ‘at’ point хо while in the case of a pole of odd order 
the sign of function f changes ‘at’ point xo. Finally, function f has an oscillation point ‘at’ 
xo if function f is indefinitely divergent as x tends to хо (i.e. neither the limit of function f 
as x tends to xo exist nor function f tends to +00 as x tends to xo). In the above cases of a 
(finite or infinite) jump, a pole and an oscillation point, we also say that function f has an 
irremovable discontinuity at point xo. 


For illustration, we consider the following examples. 


Example 4.5 Let function f : Dy — R with 


ТЕСЕ 
КОШ ыа. 
х—1 


be given. In this case, we have хо = 1 ¢ Dy, but 
lim f(x) = 3 
x—> xo 


(see Figure 4.2). Thus, function f has a gap at point xo = 1. This is a removable discontinuity, 
since we can define a function f* : IR > R with 


seo | Ра) forx Al 
rel 3 forx = 1 


which is continuous at point хо = 1. 


] 
] 
] 
] 
] 
] 
] 
] 
] 
] 
] 
] 
] 
| 
1 


Figure 4.2 Function f with f (x) = (x2 +x- 2)/(х — 1). 
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Example 4.6 We consider function f : Dy — R with 


f (x) = sin E 
x 


For хо = 0, we get xo ¢ Dy, and we have already shown in Example 4.2 that function f is 
indefinitely divergent as x tends to zero. Therefore, point xo = 0 is an oscillation point. 


Analogously to Definition 4.3, we can introduce one-sided continuity. Ifthe limit of function f 
as x tends to xo + 0 exists and is equal to f (xo), then function f is called right-continuous 
at point хо. In the same way, we can define the left-continuity of a function at a point хо. 
Function f is called continuous on the open interval (a, b) € Dy ifit is continuous at all points 
of (a, b). Analogously, function f is called continuous on the closed interval [a,b] С Dy if 
it is continuous on the open interval (a, b), right-continuous at point a and left-continuous at 
point b. If function f is continuous at all points x of the domain Dy, then we also say that f 
is continuous. 


Properties of continuous functions 


First, we give some properties of functions that are continuous at particular points. 


THEOREM 4.3 Let functions f : Dy — R and g : Dg — К be continuous at point 
xo € Dy N Dg. Then functions f + 9,7 — 9,7 - g and, for g(xo) 4 0, also function f /g are 
continuous at xo. 


THEOREM 4.4 Let function f : Dy — R be continuous at xo € Dy and function g : Dg > 
К be continuous at point x; = f (xo) € Dg. Then the composite function g o f is continuous 
at point xo, and we get 


jim gf 6) = g (tims о) = g(f xo). 


The latter theorem implies that we can ‘interchange’ the determination ofthe limit of function 
f as x tends to xo and the calculation of the value of function g. We continue with three 
properties of functions that are continuous on the closed interval [a, 5]. 


THEOREM 4.5 Let function f : Dy > R be continuous on the closed interval [а,Ь] С Dy. 
Then function f is bounded on [a, b] and takes its minimal value fmin and its maximal value 
fmax at points Хип and Xmax, respectively, belonging to the interval [a, b], i.e. 


Smin =f Xmin) € f Œ) € (тах) = fmax for all x € [а,Ь]. 


Theorem 4.5 does not necessarily hold for an open or half-open interval, e.g. function f with 
f (x) = 1/x is not bounded on the left-open interval (0, 1]. 


THEOREM 4.6 (Bolzano's theorem) Let functionf : Dy — К be continuous on the closed 
interval [a,b] € Dy with f (a)-f (b) < 0. Then there exists an x* € (a, b) such that f (x*) = 0. 
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Theorem 4.6 has some importance for finding zeroes of a function numerically (see later in 
Chapter 4.8). In order to apply a numerical procedure, one needs an interval [a, b] (preferably 
as small as possible) so that a zero of the function is certainly contained in this interval. 


THEOREM 4.7 (intermediate-value theorem) Let function f : Dy — R be continuous on 
the closed interval [a,b] С Dy. Moreover, let fmin be the smallest and fmax the largest value 
of function f for x € [a,b]. Then for each y* € [fmin, fmax], there exists an x* € [a,b] such 
that f (x*) = y*. 


The geometrical meaning of Theorem 4.7 is illustrated in Figure 4.3. The graph of any line 
y = у“ € [fmin;fmax] intersects at least once the graph of function f . For the particular value 
у“ chosen in Figure 4.3, there are two such values x} and x5 with f (x1) = f (x5) = у". 


fmin4 


Figure 4.3 Illustration of Theorem 4.7. 


4.0 DIFFERENCE QUOTIENT AND THE DERIVATIVE 


We now consider changes in the function value y = f(x) in relation to changes in the 
independent variable x. If we change the value x of the independent variable by some value 
Ax, the function value may also change by some difference Ay, i.e. we have 


Ay = f (x + Ax) — f(x). 


We now consider the ratio of the changes ^y and Ax and give the following definition. 


Definition 4.4 Let f : Dy > R and xo, xo + Ax € (a, b) € Dy. The ratio 


Ay _ f@o + Ax) — f (xo) 
Ax — Ax 


is called the difference quotient of function f with respect to points хо + Ax and xo. 
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The quotient Ay/ Ax depends on the difference Ax and describes the average change of the 
value of function f between the points хо and xo + Ax. Let us now consider what happens 
when Ax — 0, i.e. the difference in the x values of the two points хо + Ax and хо becomes 
arbitrarily small. 


Definition 4.5 Let f : Dy — R. Then function f with y = f(x) is said to be 
differentiable at point xg € (a, b) € Dy if the limit 


li f (xo + Ax) — f (xo) 
im 
Ax—0 Ax 


exists. The above limit is denoted as 


df (xo) _ 
de 5 


f! (xo) 


and is called the differential quotient or derivative of function f at point хо. 


We only mention that one can define one-sided derivatives in an analogous manner, e.g. the 
left derivative of function f at some point xy € Dy can be defined by considering only 
the left-side limit in Definition 4.5. If function f is differentiable at each point x of the open 
interval (а, b) € Dy, function f is said to be differentiable on the interval (а, Р). Analogously, 
if function f is differentiable at each point x € Dy, function f is said to be differentiable. 


If for any x € Dy the derivative f'(x) exists, we obtain a function f" with y = f'(x) by 
assigning to each x € Dy the value f" (x). We also say that function f" is the first derivative 
of function f . If function f" is continuous, we say that the original function f is continuously 
differentiable. 


In economics, function f” is also referred to as the marginal of ог the marginal function. It 
reflects the fact that the derivative characterizes the change in the function value provided that 
the change in the variable x is sufficiently small, i.e. it can be considered as ‘marginal’. This 
means that the marginal function can be interpreted as the approximate change in function f 
when variable x increases by one unit from xo to xo + 1, Le. 


f (xo) © f (xo + 1) — f (xo). 


A geometric interpretation of the first derivative is as follows. The value f" (xo) is the slope 
of the tangent to the curve y = f (x) at the point (xo, f (xo)) (see Figure 4.4). Consider the line 
through the points (xo, f (xo)) and (хо + Ax, f (xo + Ax)). For the slope of this line we have 
Ay 
—- — tan p. 
Ax P 
If Ax becomes smaller and tends to zero, the angle of the corresponding line and the x axis 
tends to о, and we finally obtain 


A 
fo) = a 
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Line through points 


4— (wo, f(xo)) and 
(to + Az, f (xo4- Ax)) 


157 


Ay 
ШЕШ 
Tangent to f(x) at 
(xo, f (zo)) 
x 
Figure 4.4 Geometrical interpretation of the derivative f’ (xo). 
Example 4.7 Let function f : R > R with 
2x fo x«l 
›=/® = | 3-х for 1<х<2 
be given and let xy = 1. We obtain 
Гоо + Ax) — f (xo) ..— 2(xo + Ax) — 2х0 2. 2Ах 
lim = lim = lm —=2 
Ax— 0-0 Ax Ax— 0-0 Ax Ax—0—0 Ax 
and 
_ f Go + Ax) — f (xo) 20003 — (xo + Ax) — 3 — xo) 
lim = lim 
Ax—0+0 Ax Ax—0+0 Ax 
Ў = Ах 
= lim = — 
Ах>0+0 Ax 


Consequently, the differential quotient of function f at point хо = 1 does not exist and 
function f is not differentiable at point хо = 1. However, since both one-sided limits of 
function f as x tends to 1 + 0 exist and are equal to f(1) = 2, function f is continuous at 


point xo = 1. 


The latter example shows that a function being continuous at point xo is not necessarily 
differentiable at this point. However, the converse is true. 


THEOREM 4.8 Let function f : Dy — R be differentiable at point xọ € Dy. Then function f 


is continuous at xo. 


158 Differentiation 
4.5 DERIVATIVES OF ELEMENTARY FUNCTIONS; DIFFERENTIATION 
RULES 


Before giving derivatives of elementary functions, we consider an example for applying 
Definition 4.5. 


Example 4.8 We consider function f : R — R with 
Јо) = 
and determine the derivative f” at point x: 


ы — 
eee хх ш ЖО с, 
Ах—>0 Ах Ax Ee 


x3 + 3x? Ax + 3x (Ax? + (Ах)? — x3 
Ax—0 Ax 

Ах. [3x2 + 3x Ax + (Ax)?] 
Ax—0 Ax 


= 3x”. 


The above formula for the derivative of function f with f(x) = x? can be generalized to the 
case of a power function f with f(x) = х", for which we obtain f'(x) = пх” !. 


The determination of the derivative according to Definition 4.5 appears to be unpractical for 
frequent use in the case of more complicated functions. Therefore, we are interested in an 
overview on derivatives of elementary functions which we can use when investigating more 
complicated functions later. Table 4.1 contains the derivatives of some elementary functions 
of one variable. 


Table 4.1 Derivatives of elementary functions 


»-fG) y-f'G) Dy 


C 0 —oo«x«oo, Cisconstant 
xt nx"-l —oo«x«oo neN 
x ax?! 0cx«oo, acm 
e* e* —oo«x«oo 
a* a*]na —oo«x«oo a»0 
1 
Inx = 0 <x <œ 
x 
1 
log, x 0«x«oo, а> 0 
xlna 
sinx cosx —oo«x«oo 
cos x — sinx —00 <x < oo 
1 л 
tanx 3 x#—+kn, Кє? 
cos“ x 2 
1 
cot x T) x z km, keZ 
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Sum, product and quotient rules 


Next, we present formulas for the derivative of the sum, difference, product and quotient of 
two functions. 


THEOREM 4.9 Letfunctionsf : Dy — Randg : Dg — R be differentiable atx є De Dg. 
Then the functions f + g, – g, f д and, for g(x) Æ 0, also function f /g are differentiable 
at point x, and we have: 


(D) +9) 0) = f'GO + 9’); 

(2) (f — 4) G) =f" (x) — g E); 

(3) (7-9) G0 —f'G)- g(x) +f D - д' О); 
(£) Ро) яб) — о) : g'G) 

(4) (x) = 


g [4 (х)]? 


As a special case of part (3) we obtain: if f(x) = C, then (C - g) (x) = С. g'(x), where C is 
a constant. 


Example 4.9 Let function f : Dy — R with 
f) = 4x^ — 2x + Inx + /x 


be given. Using ./x = x!/?, we obtain 


(OMe =й есы 
Е х 24 


Example 4.10 In macroeconomics, it is assumed that for a closed economy 
Ү= С+1, 


where Y is the national income, С is the consumption and J is the investment. Assume that 
the consumption linearly depends on the national income, i.e. equation 


C=a+byY 


holds, where a and b are parameters. Here C'(Y) = b is called the marginal propensity to 
consume, a parameter which is typically assumed to be between zero and one. If we want to 
give the national income as a function depending on the investment, we obtain from 


Y=a+bY+I 


the function Y as 


acl 


¥=YQ)=7—. 
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For the derivative Y’ (J) we obtain 


; dY 1 
YD) = НЫ 
1 1—b 


The latter result can be interpreted such that an increase in J by one unit leads to an increase 
of Y by 1/(1 — b) > 0 units. 


Example 4.11 Let functions f : К > R and g : R —> R with 
ғо) = х2 апа g(x) = e 

be given. Then 
F- g)' (x) = 2xe* + x!e* = xe'(x + 2) 


E» 2хех —x!& | xe*(2—x) х(2- х) 
(2) = = S | 


(ех)2 = (ex? e* 


Derivative of composite and inverse functions 
Next, we consider composite and inverse functions and give a rule to determine their 
derivatives. 


THEOREM 4.10 Let functions f : Df > R and g : Dj — R be continuous. Then: 


(1) If function f is differentiable at point x є Dy and function g is differentiable at point 
у= f (x) € Dg, then function g o f is also differentiable at point x and 


(9 of) (х) = (907) (х) = g'(Q G2) - f£ G2 (chain rule). 


(2) Let function f be strictly monotone on Dy and differentiable at point x € Dy with 
f'(x) # 0. Then the inverse function f^! with x = f^! (y) is differentiable at point 
y — f(x) € Dy-1 and 


1 1 


-lv — = К 
V T0 — OF 19) 


An alternative formulation of the chain rule with ^ = g(y) and y = f (x) is given by 


dh а dy 


dx dy dx 


The rule given in part (2) of Theorem 4.10 for the derivative of the inverse function f^! can 
also be written as 


y 1 
(f ) fof 


Differentiation 161 


Example 4.12 Suppose that a firm produces only one product. The production cost, denoted 
by C, depends on the quantity x > 0 of this product. Let 


C — f(x) = 4-- In(x + 1) + 3x +1. 


Then we obtain 


df(x) _ 1 ik 3 
dv x+1 2,/3х+1 


Assume that the firm produces xo — 133 units of this product. We get 


С = (к) = 


1 3 1 3 
C'(133) = = = 0.08246. 
(99) 134 E: 24/400 134 i: 40 


So, if the current production is increased by one unit, the production cost increases 
approximately by 0.08246 units. 


Example 4.13 Let function g : Dj — R with 
g(x) = cos [In(3x + DD? 

be given which is defined for x > —1/3, i.e. Dj = (—1/3, оо). Setting 
g = cosv, v=h’, h= һу and у = (3х + 1), 


application of the chain rule yields 


‚ dg dv dh dy 
g œ) = т A EET TÉ , 


and thus we get 


g'(x) = —sin[In(3x + DP -21n(3x + D) - 3 


3x41. 


6 
zi cum sin [In Gx + DP - In(3x + 1). 


By means of Theorem 4.10, we can also determine the derivatives of the inverse functions 
of the trigonometric functions. Consider the function f with 


. лл 
y =f (x) = 50 x, хє es =) > 


and the inverse function 


x = f7! (y) = arcsin y. 
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Since f" (x) = cos x, we obtain from Theorem 4.10, part (2): 
1 1 


f'G) cosx 


Because cosx = +v cos? x = y 1 — sin? x = \/1 — y? for x € (-л/2, 7/2), we get 
5 


Л!» = 


1 
уо) = =, 
1—у 


and after interchanging variables x апа у 


TYE) = 


1 
V1 — x2 


Similarly, we can determine the derivatives of the inverse functions of the other trigono- 
metric functions. We summarize the derivatives of the inverse functions of the trigonometric 
functions in the following overview: 


1 
f(x) =aresinx, (х) = , Dre={xeR|—-l <x < 1}; 
М1 — x2 
1 
f(x) = агссоѕх, f’(x)=— , Dr={xEeR|—-1 <x < l}; 
1 — x? ^ 
f(x) = агсќапх, (х) = Dra Df = {x € R | ~œ < x < œ}; 
x 
f(x) =arccotx, f’(x) = -——, Dy = {x € R| оо < x < оо). 
1+ x? f 


Logarithmic differentiation 


As an application of the chain rule, we consider so-called logarithmic differentiation. Let 


g(x) = Inf (x) 


We obtain 
g' (x) = [Inf G)T 


and thus 


with f(x) > 0. 


Ро) 


f(x) 


Го) = 0) 9 w) =f) - Inf G)T. 


Logarithmic differentiation is particularly useful when considering functions f of type 


Ро) = н)", 


i.e. both the basis and the exponent are functions depending on variable х. 
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Example 4.14 Let function f : Dy — R with 


feo = xSinx 


be given. We set 
g(x) = Inf (x) = In (| = sinx - lnx. 

Applying the formula of logarithmic differentiation, we get 
g'(x) = cosx · Inx + sin x- : 


and consequently 


Ро) =f) g' (x) = х c -Inx+sinx- 3 : 


Higher-order derivatives 


In the following we deal with higher-order derivatives. If function f" with y' = f' (x) is again 
differentiable (see Definition 4.5), function 


df'v) ау() 
dx (ах)? 


y" = f" (x) Z 


is called the second derivative of function f at point x. We can continue with this procedure 
and obtain in general: 


df" c) а(х) 

у" =f" (х) 3 dopo = 

which denotes the nth derivative of function f at point x € Dy. Notice that we use 
РО), f(x), f(x), and for n > 4, we use the notation f(x). Higher-order deriva- 
tives are used for instance in the next section when investigating specific properties of 
functions. 


Example 4.15 Let function f : Dy > R with 


1 
До) = 3x + -H e” 
x 
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be given. We determine all derivatives until fourth order and obtain 
/ 1 2х 
f (x) = 6х – — + 2е 
X 
Hn 2 2x 
f œ) =6+ as 4e 
x 
/// 6 2x 
f о) = S ET 8e 
x 
24 


FOR) == 16e. 
x 


If the derivatives f" (хо), (xo), ...,f (xo) all exist, we say that function f is n times 
differentiable at point xo є Dy. If f n) is continuous at point xo, then function f is said to 
be n times continuously differentiable at хо. Similarly, if function f is continuous, then 
function f is said to be n times continuously differentiable. 


44 DIFFERENTIAL; RATE OF CHANGE AND ELASTICITY 


In this section, we discuss several possibilities for characterizing the resulting change in 
the dependent variable y of a function when considering small changes in the independent 
variable x. 


Definition 4.6 Let function f : Dy — К be differentiable at point хо € (a, b) С Dy. 
The differential of function f at point xo is defined as 


dy = f' (xo) - dx. 


The differential is also denoted as df. Note that dy (or df) is proportional to dx, with 
f'(xo) as the factor of proportionality. The differential gives the approximate change in 
the function value at хо when changing the argument by (a small value) dx (i.e. from хо to 
хо + dx), i.e. 


Ay © dy = f' (xo) - dx. 
The differential is illustrated in Figure 4.5. 


The differential can be used for estimating the maximal absolute error in the function value 
when the independent variable is only known with some error. Let | Ax| = |dx| < A, then 


|Ayl © |dy| = (f£ (хо) - Idx] < If (о) - А. 
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Figure 4.5 The differential dy. 


Example 4.16 Let the length of an edge of a cube be determined as х = 7.5 + 0.01 cm, 
i.e. the value x9 = 7.5 cm has been measured, and the absolute error is no greater than 0.01, 
ie. |dx| = |Ax| < A = 0.01. For the surface of the cube we obtain 


So = S(xo) = 6х2 = 337.5 cm?. 
We use the differential to estimate the maximal absolute error of the surface and obtain 
|AS| ~ |dS| < |S'(xo)| - A = |12xo| - 0.01 = 0.9 cm’, 


i.e. the maximal absolute error in the surface is estimated by 0.9 cm?, and we get 


So © 337.5 + 0.9 cm?. 
Moreover, for the estimation of the maximal relative error we obtain 


dS 
So 


0.9 
< 
3375 


AS 
| ғ 0.00267, 


So 


A^ | 


i.e. the maximal relative error is estimated by approximately 0.267 per cent. 


We have already discussed that the derivative of a function characterizes the change in the 
function value for a *very small' change in the variable x. However, in economics often a 
modified measure is used to describe the change of the value of a function f. The reason for 
this is that a change in the price of, e.g. bread by 1 EUR would be very big in comparison 
with the current price, whereas the change in the price ofa car by 1 EUR would be very small. 
Therefore, economists prefer measures for characterizing the change of a function value in 
relation to this value itself. This leads to the introduction of the proportional rate of change 
of a function given in the following definition. 
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Definition 4.7 Let function f : Dy — R be differentiable at point xo є (a,b) € Dy 
and f (xo) Æ 0. The term 


f' (xo) 
f (xo) 


is called the proportional rate of change of function f : Dy — IR at point xo. 


pf (xo) = 


Proportional rates of change are often quoted in percentages or, when time is the independent 
variable, as percentages per year. This percentage rate of change is obtained by multiplying 
the value ру (xo) by 100 per cent. 


Example 4.17 Let function f : Dy — IR with 
f(x) = xe" 
be given. The first derivative of function f is given by 
f Gy = xe? + ge = xe? 0.10) 
and thus, the proportional rate of change at point x € Dy is calculated as follows: 


f'(x)  xe"*(2--0.x) 2 
pp) То) 520015 E 
We compare the proportional rates of change at points хо = 20 and xı = 2,000. For xo = 20, 
we get 


2 
20) = — + 0.1 = 0.2 
pr (20) 20 t 


which means that the percentage rate of change is 20 per cent. For xı = 2,000, we get 


2 
2.000) = —_— + 0.1 = 0.101 
pf (2, 000) 2.000 +0 | 


i.e. the percentage rate of change is 10.1 per cent. Thus, the second percentage rate of change 
is much smaller than the first one. 


Definition 4.8 Let function f : Dy > R be differentiable at point xo € (a,b) € Dy 
and f (xo) Æ 0. The term 


xo : f' (xo) 
f (xo) 


is called the (point) elasticity of function f : Dy — R at xo. 


ef (x0) = = xo · pr (Xo) 
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In many economic applications, function f represents a demand function and variable x 
represents price or income. The (point) elasticity of function f corresponds to the ratio of the 
relative (i.e. percentage) changes of the function values of f and variable x: 


ef (xo) = lim (= . v) — lim BIO), ~) 
ESE Ax>0 (xo) Ax ^ Axe f (xo) | хо | 


An economic function f is called elastic at point xo € Dy if |e¢(xo)| > 1. On the other hand, 
it is called inelastic at point xo if |er(xo)| < 1. In the case of |er(xo)| = 1, function f is of 
unit elasticity at point xo. 


In economics, revenue R is considered as a function of the selling price p by 
R(p) =p: D(p), 


where D is the demand function that is usually decreasing, i.e. if the price p rises, the quantity 
D sold falls. However, the revenue (as the product of price p and quantity D) may rise or fall. 
For the marginal revenue function, we obtain 


R'(p) — p: D'(p) + Dp). 
Assume now that R'(p) > 0, i.e. from р. D'(p) + D(p) > 0 we get for D(p) > 0 


p:D'p — 
D(p) | 


where due to D'(p) < 0, we have |ep(p)| « 1. Thus, if revenue increases when the price 
increases, it must be at an inelastic interval of the demand function D = D(p), and in 
the inelastic case, a small increase in the price will always lead to an increase in revenue. 
Accordingly, in the elastic case, i.e. if |ep(p)| > 1, a small increase in price leads to a 
decrease in revenue. So an elastic demand function is one for which the quantity demanded 
is very responsive to price, which can be interpreted as follows. If the price increases by one 
per cent, the quantity demanded decreases by more than one per cent. 


ep(p) = 


Example 4.18 We consider function f : (0, оо) — К given by 
toma 


and determine the elasticity of function f at point x € Dy. First, we calculate the first 
derivative f" by applying logarithmic differentiation. We set 


g(x) = Inf (x) = (x + 1) Inx. 
Therefore, 


LO х+1 
yu) iem = 


Thus, we obtain for the first derivative 


9 (х) = ғо) = 


отоо. (ins Ht - | 


x 
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For the elasticity at point x € Dy, we obtain 


xvf x) x. x**l (их + =!) 
fe. qu 


ef (x) = =xInx+x+1. 


4.5 GRAPHING FUNCTIONS 


To get a quantitative overview on a function f : Dy — К resp. on its graph, we determine 
and investigate: 


(1) domain Dy (if not given) 

(2) zeroes and discontinuities 

(3) monotonicity of the function 

(4) extreme points and values 

(5) convexity and concavity of the function 

(6) inflection points 

(7) limits, i.e. how does the function behave when x tends to oo. 


Having the detailed information listed above, we can draw the graph of function f. In the 
following, we discuss the above subproblems in detail. 


In connection with functions of one variable, we have already discussed how to determine 
the domain Dy and we have classified the different types of discontinuities. As far as the 
determination of zeroes is concerned, we have already considered special cases such as 
Zeroes of a quadratic function. For more complicated functions, where finding the zeroes 
is difficult or analytically impossible, we give numerical procedures for the approximate 
determination of zeroes later in this chapter. We start with investigating the monotonicity of 
a function. 


4.5.1 Monotonicity 


By means of the first derivative f", we can determine intervals in which a function f is 
(strictly) increasing or decreasing. In particular, the following theorem can be formulated. 


THEOREM 4.11 Let function f : Dy — R be differentiable on the open interval (a, b) and 
let J = [a,b] € Dy. Then: 


(1) Function f is increasing on / if and only if f" (x) > 0 for all x € (a, b). 
(2) Function f is decreasing on J if and only if f'(x) < 0 for all x € (a, b). 
(3) Function f is constant оп J if and only if f'(x) = 0 for all x € (a, b). 
(4) I£f" (x) > 0 for all x є (a, b), then function f is strictly increasing on J. 
(5) Iff’(x) < 0 for all x € (a, b), then function f is strictly decreasing on J. 


We recall from Chapter 3 that, if a function is (strictly) increasing or (strictly) decreasing on 
an interval I C Dy, we say that function f is (strictly) monotone on the interval 7. Checking a 
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function f for monotonicity requires us to determine the intervals on which function f is 
monotone and strictly monotone, respectively. 


Example 4.19 We investigate function f : IR > R with 
f(x) = е2 +2x + 1) 
for monotonicity. Differentiating function f, we obtain 
ро) = (x? 4 2x +1) + e'Qx + 2) = e (x? + 4х + 3). 
To find the zeroes of function f’, we have to solve the quadratic equation 
х +45 +3= 0 
(notice that е” is positive for x € IR) and obtain 
xy =—24+V4—-3=-1 and x2 = —2 V4—3 = —3. 
Since we have distinct zeroes (i.e. the sign of the first derivative changes ‘at’ each zero) and 
e.g. f'(0) = 3 > 0, we get that f'(x) > 0 for x є (—00, —3) U (—1, оо) and f'(x) < 0 for 


x € (—3, —1). By Theorem 4.11 we get that function f is strictly increasing on the intervals 
(—оо, —3] and [—1, oo) while function f is strictly decreasing on the interval [—3, —1]. 


4.5.2 Extreme points 


First we give the definition of a local and of a global extreme point which can be either a 
minimum or a maximum. 


Definition 4.9 А function f : Dy > К has a local maximum (minimum) at point 
xo € Dy if there is an interval (a, b) € Dy containing xo such that 


f (x) < f (xo) (f (x) = f (xo), respectively) (4.1) 


for all points x € (a,b). Point xo is called a local maximum (minimum) point. If 
inequality (4.1) holds for all points x € Dy, function f has at point xo a global maximum 
(minimum), and ху is called a global maximum (minimum) point. 


These notions are illustrated in Figure 4.6. Let Dy — [a, b]. In the domain Dy, there are two 
local minimum points x? and x4 as well as two local maximum points x; and x3. The global 
maximum point is x3 and the global minimum point is the left boundary point a. We now 
look for necessary and sufficient conditions for the existence of a local extreme point in the 
case of differentiable functions. 


THEOREM 4.12 (necessary condition for local optimality) Let function f : Dy — IR be 
differentiable on the open interval (a, b) С Dy. If function f has a local maximum or local 
minimum at point хо € (a, Б), then f' (xo) = 0. 
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A point xo € (a, b) with f’ (xo) = 0 is called a stationary point (or critical point). In searching 
global maximum and minimum points for a function f in a closed interval J = [a,b] C Dy, 
we have to search among the following types of points: 


(1) points in the open interval (a, b), where f'(x) = 0 (stationary points); 
(2) end points a and b of J; 
(3) points in (a, b), where f'(x) does not exist. 


Points in / according to (1) can be found by means of differential calculus. Points according 
to (2) and (3) have to be checked separately. Returning to Figure 4.6, there are three stationary 
points x2, x3 and x4. The local maximum point x; cannot be found by differential calculus 
since the function drawn in Figure 4.6 is not differentiable at point x1. 


a v, — vs zaba 
Figure 4.6 Local and global optima of function f on Dy = [а, 6]. 


The following two theorems present sufficient conditions for so-called isolated local extreme 
points, for which in inequality (4.1) in Definition 4.9 the strict inequality holds for all x € 
(a, b) different from хо. First, we give a criterion for deciding whether a stationary point is а 
local extreme point in the case of a differentiable function which uses only the first derivative 
of function f. 


THEOREM 4.13 (first-derivative test for local extrema) Let function f : Dy — R be 
differentiable on the open interval (a,b) є Dy and хә є (a,b) be a stationary point of 
function f. Then: 


(1) Iff (x) > 0 for all x € (a*,xo) С (a, b) and f'(x) < 0 for all x € (xo, b*) С (a, Б), then 
хо is a local maximum point of function f . 

(2) Iff’(x) < 0 for all x € (a*,xo) С (a, b) and f'(x) > 0 for all x € (xo, b*) С (a, b), then 
хо is a local minimum point of function /. 

(3) Iff’(x) > 0 for all x € (a*,xo) С (a, b) and for all x € (xo, b*) С (a, b), then xo is not a 
local extreme point of function f. The same conclusion holds if f’(x) < 0 on both sides 
of xo. 
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For instance, part (1) of Theorem 4.13 means that, if there exists an interval (a*, b*) around xo 
such that function f is strictly increasing to the left ofxo and strictly decreasing to the right of xo 
in this interval, then хо is a local maximum point of function f . The above criterion requires the 
investigation of monotonicity properties of the marginal function off. The following theorem 
presents an alternative by using higher-order derivatives to decide whether a stationary point 
is a local extreme point or not provided that the required higher-order derivatives exist. 


THEOREM 4.14 (higher-order derivative test for local extrema) Let : Dy > R ben 
times continuously differentiable on the open interval (a,b) С Dy and xo € (a,b) be a 
stationary point. If 


f! (x0) =f" (xo) =f" (xo) = — f" PH) 20 and f(x) £0, 


where number n is even, then point хо is a local extreme point of function f, in particular: 


(1) Iff (xo) < 0, then function f has at point хо a local maximum. 
(2) Iff (xo) > 0, then function f has at point xo a local minimum. 


Example 4.20 We determine all local extreme points of function f : (0, оо) — R with 
1 
f(x) = = : In? 3х. 
х 
To check the necessary condition of Theorem 4.12, we determine 
1 
x 


1 1 1 
/'@) = — => In’ 3x + 2 In 3x - 208 zl - In3x(2 — In 3x). 


From equality f'(x) = 0, we obtain the following two cases which we need to consider. 
Casea Зх = 0. Then we obtain 3x = е? = 1 which yields x; = 1/3. 
Caseb 2 — In3x = 0. Then we obtain In 3x = 2 which yields x; = е2 /3. 


To check the sufficient condition for x; and x2 to be local extreme points according to 
Theorem 4.14, we determine f” as follows: 


2 1 1 1 
тә = (- 3 з) (2—1а3х)—-;.1п3х.—.3 
x x 3x 


Il 
| 
КАЕ ts | 


1 1 
-In3x+ Jr In 3x) 3 In3x 
x x 


- (1 — 31n 3x + In? 3x). 


In particular, we obtain 


1 1 54 54 
(б) eso and (5) а 3.2+22) = -75 <0. 


Hence, xı = 1/3 is a local minimum point with f (x1) = 0, and x2 = е? /3 is a local maximum 
point with f (x2) = 12/е2. 
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Example 4.21 A monopolist (i.e. an industry with a single firm) producing a certain 
product has the demand-price function (also denoted as the inverse demand function) 


p(x) = —0.04x + 200, 


which describes the relationship between the (produced and sold) quantity x of the product 
and the price p at which the product sells. This strictly decreasing function is defined for 
0 < x < 5,000 and can be interpreted as follows. If the price tends to 200 units, the number 
of customers willing to buy the product tends to zero, but if the price tends to zero, the sold 
quantity of the product tends to 5,000 units. The revenue R of the firm in dependence on the 
output x is given by 


R(x) = р(х) -x = (—0.04x + 200) -x = —0.04x* + 200x. 


Moreover, let the cost function C describing the cost of the firm in dependence on the 
produced quantity x be given by 


C(x) = 80x + 22, 400. 
This yields the profit function P with 
P(x) = R(x) — C(x) = —0.04x + 200x — (80x + 22,400) = —0.04x? + 120x — 22, 400. 


We determine the production output that maximizes the profit. Looking for stationary points 
we obtain 


P'(x) = —0.08x + 120 = 0 


which yields the point 
xp = 1,500. 
Due to 


P” (xp) = —0.08 < 0, 


the output xp = 1,500 maximizes the profit with P(1,500) = 67,600 units. Points with 
P(x) = 0 (i.e. revenue R(x) is equal to cost C(x)) are called break-even points. In the 
example, we obtain from P(x) = 0 the equation 


x? — 3, 000x + 560, 000 = 0, 
which yields the roots (break-even points) 


xı = 1,500 — 1,690,000 = 200 and x2 = 1,500+ 1,690,000 = 2, 800, 


i.e. an output x € (200; 2, 800) leads to a profit for the firm. Finally, we mention that when 
maximizing revenue, one gets from 


R'(x) = —0.08x + 200 = 0 
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the stationary point 
xg = 2,500, 


which is, because А” (2,500) = —0.08 < 0, indeed the output that maximizes revenue. 
However, in the latter case, the profit is only P(2, 500) = 27, 600 units. 


Example 4.22 The cost function С: R+ — R of an enterprise producing a quantity x of 
a product is given by 


C(x) = 0.1x? — 30x + 25,000. 
Then, the average cost function C, measuring the cost per unit produced is given by 


25,000 
Ca(x) = с = 0.1х — 30 + ———. 
х х 


We determine local minimum points of the average cost function Са. We obtain 


25,000 


Сиб) = 0.1 — ——у—. 


The necessary condition for a local extreme point is C; (х) = 0. This corresponds to 
0.1x? — 25,000 

which has the two solutions 
xı = 500 and x2 = —500. 


Since x2 ¢ Dc, the only stationary point is x} = 500. Checking the sufficient condition, we 
obtain 


25,000 
С (х)=2. 3 
and 
50, 000 
С” (500) = ————— > 0, 
125, 000, 000 


i.e. the produced quantity x = 500 minimizes average cost with C,(500) = 70. 


Example 4.23 A firm wins an order to design and produce a cylindrical container for 
transporting a liquid commodity. This cylindrical container should have a given volume Vo. 
The cost of producing such a cylinder is proportional to its surface. Let R denote the radius 
and H denote the height of a cylinder. Among all cylinders with given volume Vo, we want 
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to determine that with the smallest surface S (i.e. that with lowest cost). We know that the 
volume is obtained as 


Vo = V(R,H) = x RH, (4.2) 
and the surface can be determined by the formula 

S = S(R,H) = 2n R? + 2n RH. (4.3) 
From formula (4.2), we can eliminate variable H which yields 


Vo 
Н = =: (4.4) 


Substituting the latter term into formula (4.3), we obtain the surface as a function of one 


variable, namely in dependence on the radius А, since Vo is a constant: 


Vo Vo 
S = S(R) 22nR? -2nR. — —2| лк? - — \. 
(К) = 2лК + 20 -p (s +2) 


Now we look for the minimal value of S(R) for 0 < R < оо. For applying Theorems 4.12 
and 4.14, we determine the first and second derivatives of function S = S(R): 
Vo 2Vo 
S'(R) = 2 (әл — z) and S"(R) = 2 (m + c) 4 
Setting S'(R) = 0 yields 
2л — Vo = 0 (4.5) 


and thus the stationary point is obtained as follows: 


В = | w 
2л 


Notice that the other two roots of equation (4.5) are complex and therefore not a candidate 
point for a minimal surface of the cylinder. Moreover, we obtain 


S" (R1) = 2027 + 4л) = 12x > 0, 


i.e. the surface becomes minimal for the radius А. One can also argue without checking the 
sufficient condition that the only stationary point with a positive radius must be a minimum 
point since, both as R — 0 and as А — оо, the surface of the cylinder tends to infinity so 
that for А, the surface must be at its minimal value. Determining the corresponding value 
for the height, we obtain from equality (4.4) 


V V Vo: (2л)?/3 V V 
л Кї Jn x- V л 2л 
Л (от 
Thus, the surface of a cylinder with given volume is minimal if the height of the cylinder is 
equal to its diameter, i.e. H = 2R. 
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4.5.3 Convexity and concavity 


The definition of a convex and a concave function has already been given in Chapter 3. Now, 
by means of differential calculus, we can give a criterion to check whether a function is 
(strictly) convex or concave on a certain interval J. 


THEOREM 4.15 Let function f : Df — R be twice differentiable on the open interval 
(a, b) € Dy and let J = [a, b]. Then: 


(1) Function f is convex on J if and only iff” (х) > 0 forall x € (a, b). 
(2) Function f is concave on J if and only if/"(x) < 0 forall x € (a, b). 
(3) I£f" (x) > 0 for all x є (a, b), then function f is strictly convex on 7. 
(4) Iff" (x) < 0 for all x є (a, b), then function f is strictly concave on 7. 


Example 4.24 Let function f : IR — R with 
f(x) =ae™, a,bER, \ {0} 

be given. We obtain 
f'(x) = abeP* and (х) = abe”. 


For a,b > 0, we get f'(x) > 0 and f"(x) > 0 for all x € Dy, i.e. function f is strictly 
increasing and strictly convex. In this case, we say that function f has a progressive growth 
(since it grows faster than a linear function, which has a proportionate growth). 

Consider now function g : Ry — R with 


g(x) — aln(1 4 bx), a,b € R+ \ {0}. 
We obtain 


бу) = аЬ d "(су = аЬ? 
g = ERE anı et EO. 


For a,b > 0, we get g'(x) > 0 and g"(x) < 0 for all x € Dg, i.e. function g is strictly 
increasing and strictly concave. In this case, we say that function g has degressive growth. 


Example 4.25 We investigate function f : IR > R with 


2x 


f@) = 517 
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for convexity and concavity. We obtain 


jes > zen у, — 
Foes —2х(х? + 1)? — п 2.02 +1) - 2х 
G2 4 1). | -2«6? +)-@ — x) 45] 
i (х2 + 1)4 
EN x! — 3x 
(х2 +1)3 


To find the zeroes of the second derivative f”, we have to solve the cubic equation 

х? Зх = х. (x7 3) = 0 
which yields the solutions 

xı = 0, х = Мз апа хз = —A. 
Since e.g. /" (1) = 4: (1 — 3)/23 = —1 < 0, we have f” (x) < 0 for x € (x1,x2) = (0, V3), 
i.e. by Theorem 4.15, function f is strictly concave on the interval [0, МЗ]. Moreover, since 
we have distinct zeroes, the sign of the second derivative changes ‘at’ each zero, and by 


Theorem 4.15 we obtain: function f is strictly convex on [—4/3,0] U [V3, oo) and strictly 
concave on (—oo, —A/3] U [0, МЗ]. 


To decide whether a function is convex or concave, the following notion of an inflection 
point can be helpful. 


Definition 4.10 Let functionf : Dy — R be twice differentiable on the open interval 
(a,b) € Dy. Point ху € (a,b) is called an inflection point of function f when f 
changes at xo from being convex to being concave or vice versa, i.e. if there is an 
interval (a*, b*) С (a, b) containing xo such that either of the following two conditions 
holds: 


(1) f"(x) > 0 ifa* <x « xo and f" (x) € 0 ifxo « x < b* or 
(2) Р(х) <0 ifa* <x « xo and T (x) = 0 ifxo <x < Ё*. 


Consequently, if the second derivative changes the sign ‘at’ point хо, then point хо is an 
inflection point of function f. The notion of an inflection point is illustrated in Figure 4.7. 
Next, we give a criterion for an inflection point of function f. 
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Figure 4.7 Inflection point хо of function f. 


THEOREM 4.16 Let function f : Dy — К be n times continuously differentiable on the 
open interval (a, b) C Dy. Point xo € (a, b) is an inflection point of function f if and only if 


уо) =f" (x9 = =f" YG) =0 and fx) £0, 


where n is odd. 


Example 4.26 We consider function f : IR —> R with 
f(x) = х + 2х5 —12x2 420 

and determine inflection points. We obtain 
Р(х) = Ax? + 6x? — 24x and (х) = 12x? + 12x — 24. 

In order to solve f" (x) = 0, we have to find the roots of the quadratic equation 
х2 +х-2 = 0, 

which gives x; = 1 and x? = —2 as candidates for an inflection point. Using 
f" (x) = 24x + 12, 


we obtain f” (1) = 36 = 0 and f" (—2) = —36 Æ 0, i.e. both points x; = 1 and x; = —1 
are inflection points of function f. 
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4.5.4 Limits 


We have already discussed some rules for computing limits of sums, differences, products 
or quotients of functions. However, it was necessary that each of the limits exists, i.e. each 
of the limits was a finite number. The question we consider now is what happens when we 
want to determine the limit of a quotient of two functions, and both limits of the function in 
the numerator and the function in the denominator tend to co as x tends to a specific value хо. 
The same question arises when both functions in the numerator and in the denominator tend 
to zero as x approaches some value xo. 


Definition 4.11 Ifin a quotient both the numerator and the denominator tend to zero 
as x tends to xo, we call such a limit an indeterminate form of type *0/0", and we write 


The notion 'indeterminate form' indicates that the limit cannot be found without further 
examination. There exist six further indeterminate forms as follows: 


*oo/oo", “0 b оо”, “со Et оо”, “00”, “ool”, sep eo 


For the two indeterminate forms “0/0” and “оо/оо”, the limit can be found by means of the 
following rule. 


THEOREM 4.17 (Bernoulli-l Hospital's rule) Let functionsf : Df > Randg : D, > К 
both tend to zero as x tends to xo, or f and g both tend to оо as x tends to xo. Moreover, let 
f and g be continuously differentiable on the open interval (a, b) € Dy N Dg containing хо 
and g'(x) Æ 0 for x є (a, b). Then 


p [eg 49 


x—> xo g(x) x—> xo g' (x) ~ 


Here either the limit exists (i.e. value L is finite) or the limit does not exist. 


It may happen that after application of Theorem 4.17, we still have an indeterminate form 
of type “0/0” or “oo/oo”. In that case, we apply Theorem 4.17 repeatedly as long as we 
have found L. Moreover, we mention that Theorem 4.17 can be applied under appropriate 
assumptions also to one-sided limits (i.e. x — хо + 0 or x — хо — 0) as well as to the cases 
x — —oo and x — oo. To illustrate Theorem 4.17, consider the following example. 


Example 4.27 We determine the limit 


р esin PA е2х 
І = lim — WH, 
x>0 sin 2x — 2x 
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which is an indeterminate form of type “0/0”. Applying Theorem 4.17 repeatedly three times, 
we obtain 


Т: (e822 cos 2х — е2) i e8in 2X cog 2x — е?^ “0/0” 
E 2cos2x — 2 "x50 соз2х—1 
| 2 fem (cos? 2x — sin 2x) e™] » $5 
= lim : 0/0 
x0 —2 sin 2x 
" 2eSin 2x cos 2x(cos? 2x — sin 2x) ein 2x (_4 cos 2x - sin2x — 2 cos 2x) — 2e” 
= hm 
x0 —2 cos 2x 
= 15 


In the above computations, we have to apply Theorem 4.17 three times since after the first 
and second applications of Theorem 4.17, we still have an indeterminate form of type “0/0”. 


We note that all other indeterminate forms can be transformed to one of the forms “0/0” or 
*oo/oo", and they can be treated by Theorem 4.17, too. We discuss these transformations in 
some more detail. 


Let 


lim g(x) = 0 and lim f(x) = 
x—> xo x>X0 


which corresponds to an indeterminate form of type “0. oo". Then we can reduce the latter 
case to one discussed above by considering the reciprocal expression of one function, i.e. we 
use either 


lim [f(x)-g(x)] = lim a) 
Xx— xo x xo f 
Or 
йш Fæ- ga] = dim Iu 
uo 


The first limit on the right-hand side is an indeterminate form of type “0/0”, and the second 
one is an indeterminate form of type “оо/оо”. 


In the case of an indeterminate form of type “oo — 00”, i.e. 
lim [g(x) — f 6] with lim g(x) = оо and lim f(x) = 
Xx— xo x—x0 х x0 


we can apply the following general transformation: 


1 1 SI. d 
(x) (x) 

90) -f@) = -q =, 
g(x) F(x) g GOf (x) 


where the right-hand side is an indeterminate form of type *0/0". However, we can often 
apply an easier reduction to obtain an indeterminate form that we have already discussed. 
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Example 4.28 We determine 


L= lim (v8 — 2x? — x) ; 


x— oo 


Obviously, this is an indeterminate form of type “оо — оо”. We reduce this form to the type 
“0/0” without the above general transformation. Instead of it, we perform for х з 0 the 
following algebraic manipulation: 


2x31 "NIE 
= oF |2012) -x=x(1-2) —x “oo — оо” 
х 


х 
2\13 . 
=х|{[1—-— — 1 “со.0 
x 
Thus we obtain 
1/3 
ИС 
lim ( x — 2x? — x) = lim == == 
х оо x— оо = 
x 


x 


—2/3 
1(1-2) ‚2 —2/3 
2 2 2 2 
parimi te M estin 6-3) [++ 


If we have an indeterminate form of type “0®”, “o0?” or “1 


has the structure 


œ” i.e. the limit to be determined 


L= lim gy” 
x— Xo 
with g(x) > 0, we take the natural logarithm on both sides of the equation 
ya) = go)f ? 
and obtain 
туб) = In [6/9 | 2 60 mg. 


If now x tends to xo, we obtain an indeterminate form of type “0. oo" which we can already 
treat. Suppose that this limit exists and let 


Ly = lim [reo mg], 


then we finally get 


L=e, 
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Notice that the latter is obtained from Theorem 4.4 since the exponential function is 


continuous. 


Example 4.29 We determine the limit 
L= lim (1 +x)" 

which is an indeterminate form of type “1™”. Setting 
уб) = +x)" 

and taking the natural logarithm on both sides, we obtain 
Iny(x) = :  In(1 + x). 

Applying Theorem 4.17, we get by Bernoulli-l' Hospital’s rule 


"E 6 Cy ok) o тж ое 1 
lim ———— = lim = 
x0 x x>0 1 x0 1+x 


Thus, we obtain 
lim Iny(x) = 1. 
x0 

Therefore, 


І = lim у(х) = el = e. 
x0 


4.5.5 Further examples 


In the following example, we investigate all the properties listed above when graphing 


functions. 


Example 4.30 Let us discuss in detail function f : Dy — IR with 


x? + 5x + 22 


га) = = 


Function f is defined for all x € R with x £ 2, i.e. Dy = R \ {2}. At point xo = 2, there is a 


pole of first order with 


ego: ME" с. 
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Figure 4.8 Graph of function f with f(x) = (x2 + 5x + 22)/(x — 2). 


From f (x) = x? + 5х + 22 = 0, we obtain the roots 


5 25 5 25 
Since 
25 63 
= -2=-= <0, 


there do not exist real zeroes of function f. We obtain for the first and second derivatives 


(2x + 5)(х – 2) — (х2 + 5х + 22) х2 – 4x – 32 


а (x — 2)2 T @— 2)2 


and 


(2x — 4)(x — 2)? — 2(х -2x – 4х – 32) 72 
(x — 2) КЕРН 


f"@) = 


Setting f’(x) = 0, we obtain x? — 4x — 32 = 0 which yields the stationary points 


x3=2+V4432=8 and X4 =2 — v4 +32 = —4. 
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From f" (хз) > 0, we get that хз = 8 is a local minimum point with f (x3) = 21, and from 
f" (x4) < 0, we get that x4 = —4 is a local maximum point with f (x4) = —3. Since f’(x) > 0 
for all x € (—oo, —4) О (8, оо), function f is strictly increasing on (—oo, —4] О [8, оо). 
For x € (—4,2) U (2,8), we have f’(x) < 0, and thus function / is strictly decreasing 
on [—4, 2) U (2, 8]. To determine candidates for an inflection point, we set f" (x) = 0. The 
latter equation has no solution, and thus function f does not have an inflection point. Because 
f" (x) > Oforx € (2, оо), function / is strictly convex on the interval (2, oo). Since f” (x) < 0 
for x € (—оо, 2), function f is strictly concave on the interval (—oo, 2). Moreover, 


lim f(x) = оо and lim f(x) = —oo. 
х 00 х —0о0 


The graph of function f is given in Figure 4.8. 


Example 4.31 Let the relationship between the price p of a good and the resulting demand 


D for this good be given by the demand function D = D(p), where 
400 — 10, 
=P б<р<40. (4.6) 
р+ 5 


First, we determine the extreme points of function D = D(p) and check the function for 
monotonicity as well as concavity or convexity. We obtain 
—10(p + 5) — (400 — 10p)-1 — 450 

(р + 5)? UU (р+5)7 


D'(p) = 


Thus D'(p) # 0 for all p € [0,40], and therefore function D = D(p) cannot have an 
extreme point since the necessary condition for its existence is not satisfied. Moreover, since 
D'(p) < 0 for all p € (0,40), function D = D(p) is strictly decreasing on the closed interval 
[0, 40]. Checking function D = D(p) for convexity and concavity, respectively, we obtain 


900 


——— > 0 
(р+5)3 7 


D'(p) = —450(р + 5) ?-(-2 = 
for all p € (0,40). Therefore, function D = D(p) is strictly convex on the closed interval 
[0, 40]. 

Next, we check function D — D(p) for elasticity. We obtain 
p: Dp) _ 450p(p + 5) 45p 


Dp) (р+ 5)2(400 — 10р)  (р+5)(40—р) 


ep(p) = 


We can determine points where function D = D(p) changes between being elastic and 
inelastic, i.e. we can check, where equality |ep(p)| = 1 holds and obtain 


45p =i 
(р+5)(40—р) 


Thus, 


(p + 5)(40 — p) = 45p 
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which yields the quadratic equation 
р? + 10р — 200 = 0 
with the two solutions 
р = —5 + v25 +200 = —5+15=10 and p;——5—15— —20 < 0. 


Solution p2 does not belong to the domain Dp of function D = D(p). Hence, for p = 10, 
function D is of unit elasticity. Since |ep(p)| < 1 for p € [0,10) and |ep(p)| > 1 for 
p € (10,40], we get that function D = D(p) is inelastic for p € [0,10) and elastic for 
p € (10,40]. 

Finally, we look for the function p — p(D) giving the price in dependence on the demand D. 
Since function D(p) is strictly decreasing on the closed interval [0, 40], the inverse demand 
function p — p(D) exists. Solving equation (4.6) for p, we obtain 


Dp + 5) = 400 — 10р 
p(D + 10) = 400 — 5D 
400 — 5D 
Dea m 

D+10 


Since D = D(p) is strictly monotone, the inverse function p = p(D) is strictly monotone as 
well. Because D(0) = 80 and D(40) = 0 (see also the inequality in (4.6)), function p = p(D) 
is defined for D є [0,80]. 


4.6 MEAN-VALUE THEOREM 


Before presenting the mean-value theorem of differential calculus, we start with a special 
case. 


THEOREM 4.18 (Rolle's theorem) Let function f : Dy — IR be continuous on the closed 
interval [a,b] € Dy with f (a) = f (b) = 0 and differentiable on the open interval (a, b). Then 
there exists a point x* є (a, b) with f'(x*) = 0. 


Geometrically, Rolle's theorem says that in the case of f(a) = f (b), there exists an interior 
point x* of the interval (a, b), where the tangent line is parallel to the x axis. In other words, 
there is at least one stationary point in the open interval (a, b). The mean-value theorem of 
differential calculus generalizes Rolle's theorem to the case f(a) Æ f (b). 


THEOREM 4.19 (mean-value theorem of differential calculus)  Letfunctionf : Dy > R 
be continuous on the closed interval [а,Ь] є Dy and differentiable on the open interval (a, Б). 
Then there exists at least one point x* є (a, b) such that 


f (b) — f (a) 


fen = 2 
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The mean-value theorem of differential calculus is illustrated in Figure 4.9. In this case, we 
have two points x} € (a, b) and x5 € (a, b) with the property that the derivative of function 
f at each of these points is equal to the quotient of f(b) — f(a) and b — a. 


The mean-value theorem of differential calculus can be used e.g. for the approximate 
calculation of function values. 


t 
* 


y ak 
a 21 15 b x 


Figure 4.9 Mean-value theorem of differential calculus. 


Example 4.32 Using the mean-value theorem, we compute 4/29. Let us consider function 
ІК. —> R with 


Хо) = dx. 
We get 
fiis lan 
3 : 


Applying ће mean-value theorem with a = 27 and b = 29, we get 


3 3 3 
ЛО) =) 19/29 2T 29-9 — о cT Ой 
jm 29 — 27 2 3 


which can be rewritten as 
1 
429 234 3 (x va. 


Using x* — 27 (notice that x* is not from the open interval (27, 29), but it can be chosen 
since function f is differentiable at x* = a = 27 and moreover, the value of function f" is 


186 Differentiation 


easily computable at this point), we get 


1 1 1 2 2 
OG RSs Е a El OTA 
*3'3pn *3 (33)2/3 +27 


Example 4.33 А vehicle starts at time zero and s(t) gives the distance from the starting 
point (in metres) in dependence on time / (in seconds). It is known that s(t) = 1?/2 and let 
a = 0 and b = 20. Then the mean-value theorem reads as 


s(20)—5(0  3-200—0 200 


E = = 10 = s(t) =. 
20—0 20 ag 44-89) 


The left-hand side fraction gives the average velocity of the vehicle in the interval [0, 20] 
which is equal to 10 m per second. The mean-value theorem implies that there is at least one 
value ¢* € (0,20), where the current velocity of the vehicle is equal to 10 m/s. In the above 
example, it happens at time ѓ* = 10. 


4.7 TAYLOR POLYNOMIALS 


Often it is necessary to approximate as closely as possible a ‘complicated’ function f by a 
polynomial P, of some degree n. One way to do this is to require that all n derivatives of 
function f and of the polynomial P,, as well as the function values of f and P,, coincide at 
some point x = xo provided that function f is sufficiently often differentiable. 


Before considering the general case, we discuss this question for some small values of n. 
First consider п = 1. In this case, we approximate function f around xo by a linear function 
(straight line) 


Pi(x) =f (xo) + f' (хо) : (x — xo). 


Obviously, we have Ру) = f (xo) and Р (хо) = f'(xo). For n = 2, we approximate 
function f by a quadratic function (parabola) P2 with 


Јо) 
2! 


Р(х) =f (x0) + f' (x0) - (х — хо) + (x — xo^. 


In this case, we also have Рә (xo) = f (хо), P5 (xo) =f’ (xo) and additionally P5 (xo) = f" (xo). 
Suppose now that function f is sufficiently often differentiable, and we wish to approximate 
function f by a polynomial P, of degree n with the requirements listed above. 


THEOREM 4.20 Геї function f : Dy > R ben + 1 times differentiable on the open interval 
(a, b) € Dy containing points xo and x. Then function f can be written as 


РАА 20) ; 


"оо | 
n! 


c xj 29; 


(х= х0)? ee 


(x — xo)" + К, (х), 
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where 


SOTI (xo + А(х — x9) А 
R,(x) = к x prt. 0<A <1, 


is Lagrange’s form of the remainder. 


The above form of representing function f by some polynomial together with some remainder 
is known as Taylor s formula. We still have to explain the meaning of the remainder R, (x). 
From equation 


Rn(x) =f (x) — Pax), 


the remainder gives the difference between the original function f and the approximating 
polynomial P,,. We are looking for an upper bound for the error when approximating function 
f by polynomial P, in some interval J = (xo, x). To this end, we have to estimate the maximal 
possible value of the remainder R,(x*) for x* € (xo,x). (Note that xo + A(x — xo) with 
0 < à < 1 in R,(x) above corresponds to some point x* є (xo, х).) If we know that the 
(n + 1)th derivative of function f in the interval (xo, x) is bounded by some constant M, i.e. 


Ра) | M . forx* € (o,x), 
then we obtain from Theorem 4.20: 


| Rn(x) | < Е. ЕРИ (x — xo)"*l. (4.7) 
(n 4-1)! 


Inequality (4.7) gives the maximal error when replacing function f with its nth Taylor 
polynomial 


hn plk) 
P.) = У i» xp 
k=0 


in the open interval (xo, x). The above considerations can also be extended to an interval 
(x,xo) to the left of point хо. (Note that we have to replace (x — xo) in formula (4.7) by 
| x — xo |.) 


Example 4.34 Let function f : IR — R with 
Ха) = е 


be given. We wish to approximate this function by a polynomial of smallest degree such that 
for |x| < 0.5 the maximal error does not exceed 1074. We choose xo = 0 and obtain 


о) =) = fg e. 


Thus, we have f (0) = f’(0) = --- = f? (0) = 1. Consequently, 


3 


xl x x" 
foy St bap ese Ra) 
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with 
1 А 
К, (х) _/“ Ox) хн = == igre, 
(n+ 1)! (n+ 1)! 

where 0 < à < 1. For |х| < 0.5, we have 

ех < el? ~ 1.6487 < 2. 
Thus, we determine и such that 

2.|х|"+1! 2 
A < 1074. 


(+1 a+D 2M = 

From the latter estimation it follows that we look for the smallest value of n such that 
(п+1)!-2"+! > 2. 10°, 

or equivalently, 
(п + 1-2" > 104, 

For n € {0, 1, 2,3, 4}, the above inequality is not satisfied. However, for n = 5, we get 
(n+ 1)!-2” = 720-32 = 23,040 = 2.304 - 10^ > 10°. 


We have obtained: 
5 


gU. 


k=0 


Rs(x) = < 1074 for |x| < 0.5, 


and thus we can approximate function f with f(x) = е" for |х| < 0.5 by the polynomial Р; 
with 


2 3 4 5 


x x 
P = 1 
5(x) TET recta p 


and the error of this approximation does not exceed the value 107^. 


Example 4.35 Let us approximate function f : [—4, оо) —> R with 
f(x) = Vx £4 = (х + 49? 
by a linear function Р (х) around ху = 0. This approximation should be used to give an 


estimate of 4/4.02. 
We get 


/ а = 142 7 zd. _1 , —3/2 
=; (x + 4) and / у= ( z) (x + 4) , 
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Consequently, for xy = 0 we get 


1 1 
0) = 2 а 0) = = · = = 
700) ad /0)=5 5 
We obtain 
1 1 
f@)=2+—x- с (Ах+4) 3252, O<dr<1. fem 


4 8 


In order to estimate v 4.02, we write 4.02 = 0.02 + 4 and use representation (4.8). Estimating 
the maximum error, we get for x = 0.02 the inequalities 0 < Ax < 0.02 for 0 < A < 1, and 
consequently the inequality Ax + 4 > 4. Therefore, we have 


1 
(Ах + 4) 322 < 4732 = = 


and we get the following estimate: 


4 1 
— < 1075. 


im (0.021 = |а: 0.02 +4)—3/?.(—^ | 
ame ы; 100) | © 80,000 8 


We conclude that 


1 
V4.02 22 2 + 1 - 0.02 = 2.005 


with an error less than 1075. 


4.8 APPROXIMATE DETERMINATION OF ZEROES 


In this section, we discuss several algorithms for finding zeroes of a function approximately. 
As an application of Taylor polynomials we first consider the determination of zeroes by 
Newton s method. We discuss two variants of this method. 


The first possibility is to approximate function f about xo by its tangent at xo: 
f (x) © Pi(x) =f (xo) +f" (xo) - (x — xo). 


Let хо be an initial (approximate) value. Then we have f (x) ~ Р(х) = 0, from which we 
obtain 


f (xo) +f’ (xo) - (x — xo) = 0. 


We eliminate x and identify it with xı which yields: 


Now we replace value xo on the right-hand side by the new approximate value xı, and we 
determine another approximate value x2. The procedure can be stopped if two successive 
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approximate values x, and x,+1 are sufficiently close to each other. The question that we 
still have to discuss is whether specific assumptions have to be made on the initial value xo 
in order to ensure that sequence {x,} converges to a zero of function f. The first assumption 
that we have to make is хо € [a,b] with f(a) - f(b) < 0. In the case of a continuous 
function, this guarantees that the closed interval [a, b] contains at least one zero x of function 
f (see Theorem 4.6). However, the latter condition is still not sufficient. It can be proved 
that, if function f is differentiable on the open interval (a*, Б*) C Dy containing [а,Ь] and 
if additionally f'(x) 4 0 and f"(x) Æ 0 for x є [a,b] and xo € (a, b) is chosen such that 
f (xo) - f" (xo) > 0, then the above procedure converges to the (unique) zero x є [a,b]. 
The procedure is illustrated in Figure 4.10. 


Figure 4.10 Illustration of Newton's method. 


A second possibility is that we approximate function f about хо by a parabola. This procedure 
is denoted as Newton 5 method of second order. We get: 


f (x) © Pax) =f (x0) -- f' (xo) - (x — xo) + (x — xo^. 


f^). 
2! 


Let xo be an initial (approximate) value. Then we obtain from f (х) ~ P2(x) = 0 


f" (xo) | 
2! 


(x — xo? = 0. 


ДО) +f" (xo) < (x — хо) + 


If we eliminate x and identify it with x; we get: 


оо) EVOP — 2f Gf" б) 
(ДЕЎ | 


X14,1B = X0 
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Using now x; = x14 or x1 = хів (depending on which value is the better approximation 
of the zero), instead of ху on the right-hand side of the above equation, we determine a 
new approximate value x2 and so on. Again some assumptions have to be made to ensure 
convergence of sequence {xn} to a zero. It can be proved that, if function f is differentiable 
on the open interval (a*, b*) C Dy containing [a, b] and xo € [a,b] with f(a) -f (b) < 0 and 
f" (x) Æ 0 for x є [а,Ь], then the sequence {xn} determined by Newton's method of second 
order converges to a zero x є [а,Ь]. 


The above two variants of Newton's method use derivatives. Finally, we briefly discuss two 
derivative-free methods for determining zeroes numerically. The first procedure presents a 
general iterative algorithm, also known as a fixed-point procedure, which transforms the 
equation f(x) = 0 into the form x = ф(х). A solution x with x = ф(х) is denoted as 
fixed-point. Starting with some point xo € [a, b], we compute iteratively the values 


Xn+1 = $9 Xs), п = 0, 1, Sere 


The procedure stops when two successive values x; and хр аге ‘sufficiently close’ to each 
other. A sufficient condition to ensure convergence of this approach to a fixed point is that 
there exists a constant L € (0, 1) with 


eG) < 1, for all x є [а,Ь]. 


The second derivative-free method is known as regula falsi. In this case, we approximate 
function f between хо = a and x1 = b with f(a) -f (b) < 0 by a straight line through the 
points (a, f (a)) and (b, f (b)). This yields: 


b би 
-fa = LO gay. 
— а 


Since we look for a zero x of function f, we set y = 0 and obtain the approximate value 


b— 


a 
ОЕЕО =з 


for the zero. In general, we have f (x2) Æ 0. (In the other case, we have found the exact value 
of the zero.) Now we check which of both closed intervals [a, x2] and [x2, b] contains the 
zero. If f(a) - f (хә) < 0, then there exists а zero in the interval [a, x2]. We replace b by х2, 
and determine a new approximate value x3. Otherwise, i.e. if (хо) -f (b) < 0, then interval 
[x2, b] contains a zero. In that case, we replace a by x2 and determine then an approximate 
value хз, too. Continuing in this way, we get a sequence {x,} converging to a zero x є [а,Ь]. 
This procedure is illustrated in Figure 4.11. 
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Figure 4.11 Yllustration of regula falsi. 


Example 4.36 Let f : (0, оо) > R with 
f(x) =x-Igx-3 
be given. We determine the zero contained in the closed interval [3, 4] exactly to three decimal 


places. First we apply Newton’s method. We obtain 


f@=1-— ай = 


1 
х1п 10 x?1n 10` 


Obviously we have f (3) < 0, f (4) > 0, f'(x) > Oforx є [3,4] andf" (x) > Oforx є [3,4]. 
Using 


f Gn) Xn — lg x, —3 
Xn+1 = Xn 7 = Xn 1 3 n —0,1,..., 
f (Xn) 1— PN TET 


we get the results presented in Table 4.2 when starting with xy = 4. We obtain f (хо) < 1075, 
and the value given in the last row in bold face corresponds to the zero rounded to three 
decimal places. 


Table 4.2 Application of Newton's method 


n Xn f (xn) if (xn) 

0 4 0.39794 0.89143 
1 3.55359 0.00292 0.87779 
2 3.55026 0.00000 

3 3.550 
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Next, we apply the fixed-point procedure. We can rewrite the given function as 
x=Igx+3. 

Starting with хо = 3 and using the fixed-point equation 
Xn+1 = ф(хһ) = lg xn + 3, n=0,1,..., 


we obtain the results given in the second column of Table 4.3. We note that the sufficient 
convergence condition is satisfied in this case. From 


1 
(x) 2 ——— 10-2 
ф (х) indo and In 10 > 2, 


we obtain e.g. 
j 1 
[9 Oops est for x € [3,4]. 


We note that, if one rewrites the given function as x = 10*~?, the fixed-point method does not 
converge to the zero x. (The reader can easily verify that the sufficient convergence condition 
is not satisfied in this case.) 


Finally, we apply regula falsi. Letting хо = a = 3 and x1 = b = 4, we get the results 
presented in the third and fourth columns of Table 4.3. 


Table 4.3 Application of derivative-free methods 


n Fixed-point method Regula falsi 


Xn Xn f&n) 
0 3 3 —0.47712 
1 3.47712 4 0.39794 
2 3.54122 3.5452 — 0.0044 
3 3.54915 3.5502 — 0.00005 
4 3.55012 3.550 
5 


3.550 


ЕХЕКСІЅЕЅ 


41 Find the left-side limit and the right-side limit of function f : Dy — R as x 
approaches хо. Can we conclude from these answers that function f has a limit as 
x approaches xo? 


|a for x xg, | | vx for xxl m 
QI m I T LE NE 
x fo x«l 


x+1 fo х> 1? mc 


() Ло) =, xo =0; (d) fo) = | 
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4.2 


43 


44 


4.5 


4.6 


4.7 


4.8 


Find the following limits if they exist: 


3 2 3 2 3 2 
OK OP: а 30, y X — 3х 
Qm ege УУ ec UD ae 
What type of discontinuity is at point xo — 2? 
Check the continuity of function f : Dy > R at point х = хо and define the type of 
discontinuity: 


(а) уо) = AT, xo = 2; 
(b f(x) = |х – Ц, хо = l; 
e! for х <1 
ey Jc | 2х fo x> Т хо = 1; 
(d) Хо) = е0), xo = 1. 


Ате the following functions f : Dy — R differentiable at points х = xo and x = x1, 
respectively? 


(a) f(x) = |x — 5| + 6x, xo =5, x, =0; 
cos x for x <0; 
(b) о) = { 1+x for O<x<2, x =0, x, =2; 


2x+1 for x>2. 
Find the derivative of each of the following functions f : Dy — IR with: 


(а) у= 2х — 5х – З sinx + ѕіп(л/8); (b) у= (x^ + 4x) sinx; 


2 
(c) y= m (d у= (2x3 — 3x + In x)f; 
(e) у = cos(x? + 3х2 — 8)5; (f) y = cost (x + 3х2 — 8); 
(g) y= Vsin(e*); (h) у= (х2 + 1). 


Find and simplify the derivative of each of the following functions f : Dy > R 
with: 


@) f(x) = бапх – 1) соз; — (5) fe = ере, 


(с) fŒ = (1+ x). 


Find the derivatives of the following functions f : Dy — R using logarithmic 
differentiation: 
(a) f(x) = (tanx), Dy = (0,л/2); (b) f(x) =sinx*!, Dy = (1,00); 


_ (x-2Nx-1 
() fa) = 205571, 
Find the third derivatives of the following functions f : Dy — R 
with: 
(а) fG)-x'sx; (Б) f(x) = In’); 
2 2 
© Ла) = — O Л) = + Пе, 


(х — 2)3” 


4.9 


4.10 


4.12 


4.13 


4.14 
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Given the total cost function C : Ку — R with 
C(x) = 4x? — 2x? + 4x + 100, 


where C (x) denotes the total cost in dependence on the output x. Find the marginal cost 
at x; — 2. Compute the change in total cost resulting from an increase in the output 
from 2 to 2.5 units. Give an approximation ofthe exact value by using the differential. 


Let D : Dp — R+ be a demand function, where D(p) is the quantity demand in 
dependence on the price p. Given D(p) — 320/p, calculate approximately the change 
in quantity demand if p changes from 8 to 10 EUR. 


Given the functions f : Dy > R with 
f(x) 2287 

and g : Dg — R+ with 
g(x) = 34x 


find the proportional rates of change pp (x), pg (x) and the elasticities ғу (х) and £g (x) 
and specify them for хо = 1 and xı = 100. For which values x € Dy are the func- 
tions f and g elastic? Give the percentage rate of change of the function value when 
value x increases by one per cent. 


Given the price-demand function 
D = D(p) = 1,000e-20-1* 


with demand D > 0 and price p > 0, find the (point) elasticity of demand ep(p). 
Check at which prices the demand is elastic. 


Find all local extrema, the global maximum and the global minimum of the following 
functions f : Dy > R with Dy € [—5, 5], where: 


(а) f@)=xt-38+x?-5; (Ы) Л) = 4 |х – 31; 
2 


(с) f@=e"?; (d) Ло) = >: 
NEC 
© fe = рут. 


Assume that function f : Dy > R with 
f(x) =alnx+ bx? +x 


has local extrema at point xı = 1 and at point x? = 2. What can you conclude about 
the values of a and 5? Check whether they are relative maxima or minima. 
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4.15 


4.16 


4.17 


4.18 


4.19 


4.20 


Determine the following limits by Bernoulli-l'Hospital's rule: 


: 2(x—1) 2 
li SIn x. li е Xs 
(а) im =55; (b) lim "tO 
(c) lim (x7)!/*; (d lim x; 
x00 x—> 0+0 


. ПОЕ Б Ӯ -n I} sinx 
(e) Jim (5 sin x 4); (f) Tim Bo) In^ T7. 

For the following functions f : Dy — R, determine and investigate domains, zeroes, 
discontinuities, monotonicity, extreme points and extreme values, convexity and con- 
cavity, inflection points and limits as x tends to too. Graph the functions f with f (x) 
given as follows: 


x4) 3x? — 4х 
; b — ; 
Gay GLA XL 


4 3 
© fe -3—H5 O уа: 


(е) f(x) = n 37; (f уо) = У2х2 —э9. 


(а) f@)= 


Expand the following functions f : Dy — R into Taylor polynomials with corre- 
sponding remainder: 


(а) ЈО) = зіп F, m=2, п= 5; 
(D Хо) = әх +1), х= 0, neN; 
(с) (х) =e *sin2x, x = 0, n=4. 


Calculate 1 / {/е by using Taylor polynomials for functionf : Dy — Rwithf(x) = e. 
The error should be less than 1076. 


Determine the zero x of function f : Dy > R with 
f(x) =x? — 6x42 and 0<х<1 


exactly to four decimal places. Use (a) Newton’s method and (b) regula falsi. 
Find the zero x of function f : Dy > IR with 


f(x) = х= х - 3 and x> 1. 


Determine the value with an error less than 107? and use Newton's method. 


5 Integration 


In differential calculus we have determined the derivative f" of a function f . In many applica- 
tions, a function f is given and we are looking for a function F whose derivative corresponds 
to function f . For instance, assume that a marginal cost function С” is given, i.e. it is known 
how cost is changing according to the produced quantity x, and we are looking for the corre- 
sponding cost function C. Such a function C can be found by integration, which is the reverse 
process of differentiation. Another application of integration might be to determine the area 
enclosed by the graphs of specific functions. In this chapter, we discuss basic integration 
methods in detail. 


5.1 INDEFINITE INTEGRALS 
We start with the definition of an antiderivative of a function f . 


Definition 5.1 A function F : Dr — R differentiable on an interval J C Рр is called 
an antiderivative of the function f : Dy = Dr > R if 


F'(x) = f(x) for all x € Dr. 


Obviously, the antiderivative is not uniquely determined since we can add any constant 
the derivative of which is always equal to zero. In particular, we get the following 
theorem. 


THEOREM 5.1 If function F : Dr — R is any antiderivative of function f : Dy — R, then 
all the antiderivatives F* of function f are of the form 


F*(x) = F(x) + C, 


where C є R is any constant. 


By means of the antiderivative, we can now introduce the notion of the indefinite integral, 
which is as follows. 
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Definition 5.2 Let function F : Dr — К be an antiderivative of function f. The 
indefinite integral of function f , denoted by / f (x) dx, is defined as 


[теа Fc. 


where C є К is any constant. 


Function f is also called the integrand, and as we see from the above definition, the indefinite 
integral of function f gives the infinitely many antiderivatives of the integrand f. The notation 
dx in the integral indicates that x is the variable of integration, and C denotes the integration 
constant. 

The relationship between differentiation and integration can be seen from the following two 
formulas. We have 


d d 
HET — [F(x) + С] = f(x), 
x dx 
and 
Гео [roe roc. 


The first formula says that, if we differentiate the obtained antiderivative, we again obtain the 
integrand of the indefinite integral. In this way, one can easily check whether the indefinite 
integral has been found correctly. Conversely, if we differentiate function F and find then 
the corresponding indefinite integral with the integrand F’, the result differs from function 
F only by some constant C. 


5.2 INTEGRATION FORMULAS AND METHODS 


52.1 Basic indefinite integrals and rules 


From the considerations about differential calculus in Chapter 4, we are already able to 
present some antiderivatives. Their validity can be easily checked by differentiating the right- 
hand side, where we must obtain the integrand of the corresponding left-hand side integral. 
Unfortunately, it is not possible to find analytically an antiderivative for any given function. 
Determining an antiderivative for a given function is often much harder than differentiating 
the function. 


Some indefinite integrals 


ntl 
EL +С (neZ,nz-1) 


rl 
Га = ~ +C (re R, r+#-—l, x » 0) 
r 


ЕС In |x| + C (x # 0) 
x 


fea = gcc 
a 

[oa = ra (а> 0, а%+ 1) 
E = —cosx+C 
ПЕ = sinx+C 

ах л 
f С = tanx+C (x# 2 +kr, keZ) 

cos“ x 2 

dx 

TEP = —cotx+ C (x 2 kr, кє 2) 
| za inx + C 

———— = arcsinx 

м1 = х2 

ах 
152 = arctanx+C 


Next, we give two basic rules for indefinite integrals: 


(1) f k.f(x)dx— К. | f (x) dx (constant-factor rule); 
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(2) | [/'() + g(x)] dx = f f (x) dx + | g(x) dx (sum-difference rule). 
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Rule (1) says that we can write a constant factor in front of the integral, and rule (2) says that, 


if the integrand is the sum (or difference) of two functions, we can determine the indefinite 
integral as the sum (or difference) of the corresponding two integrals. Using the given list 
of definite integrals and the two rules above, we are now able to find indefinite integrals for 
some simple functions. 


Example 5.1 Given is the integral 


= f (28 +3*—2 sinx) dx. 


Applying the rules for indefinite integrals, we can split the integral into several integrals and 
solve each of them by using the list of indefinite integrals given above. We obtain: 


x4 


Dem ae 
ПИШЕ 


122 |as | засо [ sinx as 


х 


1 3 
—2(—cosx) + C = -xf + 


2 ; 
2 ing cosx + C 


Example 5.2 We wish to find 


«f 


— 2x2 
xe 
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Using power rules and the indefinite integral for a power function as integrand, we can 
transform the given integral as follows: 


[e [5-5 8) 


5716 8/3 
= | (*- 2x5/3) de = ——-2. tess i 2 ELC. 
3 


6 


The antiderivatives and rules presented so far are sufficient only for a few specific indefinite 
integrals. Hence, it is necessary to consider some more general integration methods which 
allow us to find broader classes of indefinite integrals. One of these methods is integration 
by substitution. 


52.2 Integration by substitution 


The aim ofthis method is to transform a given integral in such a way that the resulting integral 
can be easily found. This is done by introducing a new variable ¢ by means of an appropriate 
substitution f — g(x) orx — g7! (t). The integration method by substitution results from the 
chain rule of differential calculus. 


THEOREM 5.2 Suppose that function f : Df —> R has an antiderivative F and function 
g : Dy > R with Rg С Dy is continuously differentiable on an open interval (a,b) € Dj. 
Then function z = f o g exists with z = (f o g)(x) = f (g (x)) and setting t = g(x), we obtain 


Глав) ооа | pod = Fo +с - rae» c. 


The symbol o stands for the composition of functions as introduced in Chapter 3. Theorem 5.2 
states that, if the integrand is the product of a composite function f o g and the derivative 
g’ of the inside function, then the antiderivative is given by the composite function F o g, 
where function F is an antiderivative of function f . The validity of Theorem 5.2 can be easily 
proved by differentiating the composite function F o g (using the chain rule). 


Before considering several examples, we give some special cases of Theorem 5.2 with specific 
inside functions g: 


(1) Let g(x) = ax + b. In this case, we get g'(x) = a, and Theorem 5.2 reads as 
1 1 1 

[fat dae 1 | re -a dx = fro dt— -—-F(t)--C 
a a a 


1 
= —. Ё(ах + b)+ С. 
а 


Function g describes a linear substitution. 
(2) Let (9(х)) = [g(x)]". Then Theorem 5.2 turns into 


1 
| Оаа 
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(3) Let (9(х)) = 1/[g(x)]. Then Theorem 5.2 reads as 


/ 

| ое 
g(x) 

(4) Let f (g(x)) = е. Then Theorem 5.2 corresponds to the equality 


j= : g (x)dx = e? + C. 


We illustrate the above method by the following examples. 


Example 5.3 Let us find 

f (3x + 2)* ах. 
This integral is of type (1) above, i.e. we have g(x) = 3x + 2 and f (t) = tt. Using g'(x) = 3, 
we obtain 


1 1 б 


1 
[ot i= | вх+2%.34&= + |] #@= = +с 


1 
= зу 2)? : 
15 (3x +2)? +С 


Example 5.4 Consider the integral 
| 52е dx. 


Setting t = g(x) = x?, we obtain 


d 
P. = g'(x) = 3х2. 


The application of Theorem 5.2 yields (see also special case (4) above with g(x) = x?) 


5 5 5 5 
[sera = JE dx = ; еа = z +C= 7 + С. 


Example 5.5 We want to find the integral 


5 
[ss dx 
3x2 + 1 
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The first step is to split the above integral into two integrals: 


5x 4-8 5x 8 
—— d= | L—— d —— dx. 
Ln ii lra fan * 
We next discuss how we can find both integrals by substitution. The first integral is very 


similar to case (3) of Theorem 5.2. If we had 6x in the numerator, we could immediately 
apply this case. So, we multiply both the numerator and the denominator by 6 and obtain 


5x 5 6x 5 2 
СЕЗБЕ фай: aom. +1) +81. 


The second integral is similar to one of the basic integrals given in Chapter 5.2.1. If the 
integrand is 1/(1 + x”), then we obtain from this list of indefinite integrals the antiderivative 
arctan x. Thus we transform the second integral as follows: 


8 1 8 1 
L——— dx = 8 dx = dt. 
bea i | т ies | 


In the latter integral, we can apply the linear substitution t = 4/3 · x (or equivalently, we 
apply immediately the formula given for integrals of type (1) after Theorem 5.2). Now the 
indefinite integral can be found immediately from the list of given integrals: 


8 1 8 8 
—..] a = — arctan t + Cp = — arctan ( V3 - x) + C. 
Мз f Мз Ид ( ) 1 


Combining both results and rationalizing all denominators, we obtain 


5 8 5 8 
| 2 1 dx = 6 In(3x? +1)+ = УЗ arctan (v3-x) 4C. 


Sometimes we do not see immediately that the integrand is of the type f (g(x)) - g' (x) (or 
can be easily transformed into this form). If we try to apply some substitution t = g(x) and 
if, by using the differential dt = g'(x) dx, it is possible to replace all terms in the original 
integrand by some terms depending on the new variable t, then we can successfully apply 
integration by substitution. We illustrate this by the following examples. 


Example 5.6 We determine 


e2x 
IF 1 – ех “ 


and apply the substitution t = е". By differentiation, we get 


dt _ 
dx /— 


x 
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which can be rewritten as 


dt at 
dx = — = —. 
ех t 


Using the substitution and the latter equality, we can replace all terms depending on x in the 
original integral by some term depending on f, and we obtain 


f ex d f 2 dt f t dt f dt ЖО 
E Miam .— = = arcsin А 
1 — eX 1-2 t М1— t ~ 1 — 2 


The latter indefinite integral has been taken from the list in Chapter 5.2.1. Substituting back 
again, we get 


e2x 
Í Iu dx = arcsin e* + C. 


Example 5.7 Consider the integral 


| dx 
xvx —9 
We apply the substitution t = v x? — 9 and obtain by differentiation 
dt x 
ах 2—9 
which yields 
di а 
х 2—9 


Replacing dx/4/x? — 9 in the integral, we would still have x? in the denominator. In order to 
apply integration by substitution, we must be able also to replace this term by some term in 
the variable /. Using the above substitution again, we can replace x? in the denominator by 
solving equation t = vx? — 9 for x? which yields x? = t? + 9. Hence we obtain 


| dx | dt 5 | dt 1 i tic 
== ==: == arctan 
хх? — 9 2+9 9 (iy +1 3 3 


3 


1 x? — 9 
= 3 arctan =a +C. 


Notice that in the last step, when determining the indefinite integral, we have again applied 
a substitution, namely z = t/3 (or equivalently, type (1) of the integrals after Theorem 5.2 
has been used). 
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Example 5.8 Let us consider the integral 
dx 
sinx 
In this case, we can apply the substitution 


t * t 
ап = = 
2 


which can always be applied when the integrand is a rational function of the trigonometric 
functions sin x and cos x. Solving the above substitution for x yields 


x = 2 arctan t. 


By differentiation, we obtain 


dx _ 2 
dt 1+ 


Now, we still have to replace sin x by some function depending only on variable t. This can be 
done by using the addition theorems for the sine function (see property (1) of trigonometric 
functions in Chapter 3.3.3 and its special form given by formulas (3.8)). We have 


2. sin Х - cos *% 2 sin x - cos * 2 tan х 2t 
E 2 2 ш 2 25. М. 2 


1 sin? +cos?% 1+tan? 1427" 


To get the second to last fraction above, we have divided both the numerator and the 
denominator by cos*(x/2). Then, we get for the given integral 


d 1+2 22 dt 
i a zi Le; = = +С = ап +C. 
sin x 2t 1+2 і 2 


5.2.3 Integration by parts 


Another general integration method is integration by parts. The formula for this method is 
obtained from the formula for the differentiation of a product of two functions и and v: 


[w(x) - v(x)] = w(x) - v(x) + u(x) - v). 


Integrating now both sides of the above equation leads to the following theorem which gives 
us the formula for integration by parts. 


THEOREM 5.3 Letu: D, —> Randv: D, — R be two functions differentiable on some 
open interval J = (a,b) € D, N Dy. Then: 


[uw - v' (x) dx = u(x) - v(x) — l u (x) - v(x) dx. 
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The application of integration by parts requires that we can find an antiderivative of function v' 
and an antiderivative of function и - v. If we are looking for an antiderivative of a product of 
two functions, the successful use of integration by parts depends on an appropriate choice of 
functions и and v’. Integration by parts can, for instance, be applied to the following types 
of integrals: 


(1) ICE 
(2) | Р(х) - sin ax dx, 
(3) | Р(х) · cos ax dx, 
(4) f Р(х) - e dx, 


where P, (x) = anx” + ay 1x" 1 +... + aix + ag isa polynomial of degree n. 


In most cases above, polynomial P, is taken as function u which has to be differentiated within 
the application of Theorem 5.3. (As a consequence, the derivative u’ is a polynomial of smaller 
degree.) However, in case (1) it is usually preferable to take Р, as function v' which has to 
be integrated within the application of integration by parts (so that the logarithmic function 
is differentiated). We illustrate integration by parts by some examples. 


Example 5.9 Let us find 
[€ + 2) sinx dx. 


This is an integral of type (2) above, and we set 
u(x) =x +2 and v'(x) = sinx. 
Now we obtain 
u (x) = 2x and v(x) = — cosx. 
Hence, 
fe + 2) sinx dx = — (x? + 2) cosx — foz cosx) dx 
= —(x? + 2) cos x + 2 f xcosx dx. 
If we now apply integration by parts again to the latter integral with 
u(x) =x and v' (x) = cosx, 
we get 


u(x) = 1 and v(x) = sinx. 
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This yields 


|+» sinx dx = -6° +2)¢08%42(ssinx | sinx ar) 


= —(x? + 2) cosx + 2x sin x + 2 cosx + C 


2 


= 2x sinx — x^ cosx + C. 


Notice that integration constant C has to be written as soon as no further integral appears on 
the right-hand side. 


Example 5.10 Let us determine 


| Inx ах. 


Although the integrand f(x) = Inx is here not written as a product of two functions, we can 
nevertheless apply integration by parts by introducing factor one, i.e. we set 


u(x) = Inx and v (x) = 1. 
Then we obtain 
, 1 
и (х) = – and v(x) =x 
x 
which leads to 


[ахахах | m sas - nc. 


Example 5.11 We determine 


f sin? x dx. 


In this case, we set 

u(x) = sin x and v'(x) = sinx, 
and we obtain with 

u (x) = cosx and v(x) = — cosx 


by applying integration by parts 


ГЕ ах = = sinxeosx+ | cos? xdr. 
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Now one could again apply integration by parts to the integral on the right-hand side. However, 
doing this we only obtain the identity f sin? x dx = f sin? x dx which does not yield a solution 
to the problem. Instead we use the equality 

cos? x = 1 — sin? x 


(see property (8) of trigonometric functions in Chapter 3.3.3), and the above integral can be 
rewritten as follows: 


ГЕ dx = — sin x cosx + fo — sin? x) dx 
= —зїпхсовх+х — f зї? x dx. 


Now we can add / sin? x dx to both sides, divide the resulting equation by two, introduce 
the integration constant C and obtain 


1 
E dx — 5 (х — sin x cosx) + C. 


Often, one has to combine both discussed integration methods. Consider the following two 
examples. 


Example 5.12 We determine 


x 
| — dx. 
sin^x 


Although the integrand does not have one of the special forms (1) to (4) given after 
Theorem 5.3, the application of integration by parts is worthwhile. Setting 


u(x) =x and væ) = ———, 
sin^ x 
we get 
u(x) = 1 and v(x) = —cotx. 


Function v is obtained from the list of indefinite integrals given in Chapter 5.2.1. This leads to 


x 
ji 5 ах = -xotx | сох dx. 


sin^ x 


It remains to find the integral on the right-hand side. Using 
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we can apply integration by substitution. Setting g(x) = sinx the integrand takes the form 
g'(x)/g(x) (see case (3) after Theorem 5.2), and so we obtain 


[| x dx = —x cot x + In | sin x| + C. 


Example 5.13 We find 


/ sin ./x dx. 

First, we apply integration by substitution and set t = ./x. This gives 
dt 1 1 
ах De. 2t 


which can be rewritten as 
2t dt = dx. 


Replacing ./x and dx, we get 


[sin yrds =? [зш а. 


The substitution has been successfully applied since in the right-hand side integral, only terms 
depending on variable t and dt occur. This is an integral of type (2) given after Theorem 5.3, 
and we apply integration by parts. This yields 


u(t) = t and v (t) = sint 
and 

u (t) 21 and v(f) 2 — cost. 
Thus, 


2 f rsine dt =2- (сог + f cose ar) 


= 2 . (—t cost + sin t) + C. 


After substituting back, we have the final result 


зау =2. (—Vx - cos /x + sin /x) + C. 
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5.3 THE DEFINITE INTEGRAL 


In this section, we start with consideration of the following problem. Given is a function f 
with y = f(x) = 0 for all x € [a,b] € Dy. How can we compute the area A under the graph 
of function f from a to b assuming that function f is continuous on the closed interval [a, b]? 
To derive a formula for answering this question, we subdivide [а,Ь] into n subintervals of 
equal length by choosing points 


а= х) <x] € X2 <... € Xg—] < Xn = b. 


Let l; (resp. u;) be the point of the closed interval [x; ;,x;], where function f takes the 
minimum (maximum) value, i.e. 


f) = min{ f(x) | x € [xi-1. xi]}, 
fi) = max(f (x) | x € [xi-1,xi]]. 


and let Ax; = x; — x;_1. Note that for a continuous function, the existence of the function 
values f (/j) and f (uj) in the interval [x;_1,x;] follows from Theorem 4.5. 


Then we can give a lower bound А". and an upper bound Aj, in dependence on number n 


for the area A as follows (see Figure 5.1): 


Amin = qp ^ Axi; 


i=1 


Атах = X fu) - Axi. 


i=1 


а = To Tı X We. oet Ln—1 £n =b x 
Figure 5.1 The definition of the definite integral. 


Since for each x € [xi 1,xj], we have f (li) < f(x) x (ш), i € {1,2,...,n}, inequalities 
Ahi MR Es Ап ах hold. We observe that, if и increases (i.e. the lengths of the intervals 
become smaller), the lower and upper bounds for the area improve. Therefore, we consider 
the limits of both bounds as the number of intervals tends to co (or equivalently, the lengths 


of the intervals tend to zero). If both limits of the sequences (47... and {4hax} as n tends to 
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оо exist and are equal, we say that the definite integral of function f over the interval [a, 5] 
exists. Formally we can summarize this in the following definition. 


Definition 5.3  Letfunctionf : Df — К be continuous on the closed interval [a, b] € 


Dy. If the limits ofthe sequences (A47... ] and (47,,,) as n tends to oo exist and coincide, 
Le. 


lm AM = lm AS 
(6s dd оба шык j 


then J is called the definite (Riemann) integral of function f over the closed interval 
[a,b] € Dy. 


We write 


b 
py f (x) dx 


for the definite integral of function f over the interval [a, b]. The numbers a and b, respectively, 
are denoted as lower and upper limits of integration. The Riemann integral is named after 
the German mathematician Riemann. We can state the following property. 


THEOREM 5.4 Let function f : Dy — К be continuous on the closed interval [a,b] C Dy. 
Then the definite integral of function f over the interval [a, b] exists. 


We only mention that there exist further classes of functions for which the definite integral 
can be appropriately defined, e.g. for functions that are bounded on the closed interval [a, 5] 
having at most a finite number of discontinuities in [a, 5]. 


Note that the evaluation of a definite integral according to Definition 5.3 may be rather 
complicated or even impossible. There is a much easier way by means of an antiderivative 
of function f which is presented in the following theorem. 


THEOREM 5.5 (Newton-Leibniz's formula) Let function f : Dy — IR be continuous on 
the closed interval [a,b] С Dy and function F be an antiderivative of f. Then the definite 
integral of function f over [a, b] is given by the change in the antiderivative between x = a 
and x = b: 


b 
| тода = ғо) i = F(b) — F(a). 


From Theorem 5.5 we see that also for a definite integral, the main difficulty is to find an 
antiderivative of the integrand f . Therefore, we again have to apply one of the methods pre- 
sented in Chapter 5.2 for finding an antiderivative. In the following, we give some properties 
of the definite integral which can immediately be derived from the definition of the definite 
integral. 
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Properties of the definite integral 

o [rea - o 

о | OBL f (9 

Q) [eso = [уә dx (ЄЮ); 


b с Ь 
(4) [те a= | ro ac | jd eos 


b 
f f (x) dx 


Moreover, the following property holds: 


b 
(5) < if |f G1 dx. 


t 
f Ро) dx = ғо). = F(t) — F(a) = СО). 


Using the latter formulation, we obtain 


=f ae Gas реи C)=ftt 
A) Oe O= 7z FO + ]=f@. 


The first property expresses that the definite integral with a variable upper limit can be 
considered as a function G depending on this limit. The latter property states that the derivative 
of the definite integral with respect to the upper limit of integration is equal to the integrand 
as a function evaluated at that limit. 


THEOREM 5.6 Let functions f : Dy — R and g : Dg — К be continuous on the closed 
interval [a,b] € Dy N Dg with f(x) < g(x) for x є [а,Ь]. Then 


b b 
| ore | pore 


Theorem 5.6 is also known as the monotonicity property of the definite integral. In the 
derivation of the definite integral we have considered the case when function f is non- 
negative in the closed interval [a,b] C Ру. We have seen that the definite integral gives 
the area enclosed by the function f and the x axis between x = a and x = b. In the case 
when function f is negative in the closed interval [a, b], it follows from Theorem 5.6 that the 
definite integral has a negative value (using function g identically equal to zero, the result 
follows from the monotonicity property). In this case, the definite integral gives the area 
enclosed by the function f and the x axis between x = a and x = b with negative sign. As a 
consequence, if function f has zeroes in the interval [a, b], one has to split this interval into 
subintervals in order to get the area enclosed by the function and the x axis between x = a 
and x = b. 


We continue with some examples for evaluating definite integrals. The first example 
illustrates the above comment. 
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Example 5.14 We wish to determine the area A enclosed by function f with f(x) = cosx 
with the x axis between a = 0 and b = л. Function f is non-negative in the closed interval 
[0, 7/2] and non-positive in the closed interval [7 /2, 7]. Therefore, by using property (4) 
and the above comment, the area A is obtained as follows: 


5 л 
A= | cosx &- |, cosx dx 
0 m 


2 
— s 
=sinx| — sinx 
0 


2 

= (1-0) – 0-1) = 2. 
It is worth noting that we would get value zero when evaluating the definite integral of the 
cosine function between a = 0 and b = л (since the area between the cosine function and the 


x axis in the first subinterval is equal to the area between the x axis and the cosine function 
in the second subinterval). 


Example 5.15 We evaluate the definite integral 


f dx 
1 x(14+Inx) ` 


Applying integration by substitution, we set 
t—1-clnx 


and obtain by differentiation 


Inserting both terms into the integral, we get 


|. йх " ie dt 
1 x(1+Inx) z ta) t€ ` 
Thus we obtain 


10) dt t(2) 2 
| Z = 1n rl = 1а |1 +1nx|| = ln(1 + In2) — In(1 4- In 1) = In(1 + 1а 2). 
t) t t(1) 1 


In the above computations, we did not transform the limits of the definite integral into the 
corresponding f values but we have used the above substitution again after having found an 
antiderivative. Of course, we can also transform the limits of integration (in this case, we 
get t(1) = 1 + In1 = 1 and ¢(2) = 1 + In2) and insert the obtained values directly into the 
obtained antiderivative In |t|. 


Integration 213 


Example 5.16 The marginal cost of a firm manufacturing a single product is given by 


TEES 
C'@)=6- -y 0<x< 1000, 
x 


where x is the quantity produced, and the marginal cost is given in EUR. If the quantity 
produced changes from 300 to 400, the change in cost is obtained by Newton-Leibniz's 
formula as follows: 


400 
C(400) — C(300) = |. C' (x) dx 
300 


400 60 400 
= 6 — dx = (6x — 601n |x + 1|) 
300 x41 300 


~ (2,400 — 359.64) — (1,800 — 342.43) = 582.79 EUR. 


Thus, the cost increases by 582.79 EUR when production is increased from 300 units to 
400 units. 


Example 5.17 We want to compute the area enclosed by the graphs of the two functions 
fi: R— Капар : К > R given by 


AG) =х?—4 апа БО) = 2х—х?. 
We first determine the points of intercept of both functions and obtain from 
x*—4=2x—x? 
the quadratic equation 
x*—-x-2=0 
which has the two real solutions 
ху=—1 апа х = 2. 


The graphs of both functions аге parabolas which intersect only in these two points (—1, —3) 
and (2,0). To compute the enclosed area A, we therefore have to evaluate the definite 
integral 


2 
A= I о) — fi 69] dx 
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which yields 
2 2 
A= | [ох – х2) e -4| ax = | (—2x? + 2x + 4) dx 
—1 —1 


2 sar a 2 
zl ЕГЕС 
=] 3 2 —1 


ЕЕЕ 


Thus, the area enclosed by the graphs of the given functions is equal to nine squared units. 


THEOREM 5.7 (mean-value theorem for integrals) Let function f : Dy — R be contin- 
uous on the closed interval [a,b] С Ру. Then there exists a real number x* € [a,b] such 
that 


1 b 
M = f(t) = L— f f(x) dx. 


This theorem is graphically illustrated for the case f(x) > 0 in Figure 5.2. That is, there is at 
least one value x* € [a, b] such that the dashed area 


b 
r= | f(x) dx 


is equal to the area of the rectangle with the lengths b — a and function value f (x*) (where 
value x* must be suitably determined). 


(а), 


Ket A 
N 


OV 
СО о 
ОООО 


Figure 5.2 The mean-value theorem for integrals. 
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5.4 APPROXIMATION OF DEFINITE INTEGRALS 


There are several reasons why it may not be possible to evaluate a definite integral. For 
some functions, there does not exist an antiderivative that can be determined analytically. 
As an example, we can mention here function f with f(x) = e* , Which is often applied in 
probability theory and statistics, or function g with g(x) = (sin x)/x. Sometimes it may be 
too time-consuming to determine an antiderivative, or function f may be given only as a set 


of points (x, y) experimentally determined. 


In such cases, we want to determine the definite integral approximately by applying numerical 
methods. In the following, we present a few methods and give some comments on the 
precision ofthe approximate value obtained. Approximate methods divide the closed interval 
[a, b] into n subintervals of equal length ^ = (b — a)/n, and so we get the subintervals 
[хо, x1], [Dxi, x2]... [xn— 1, x4], where a = xo and b = xn. Within each interval, we now 
replace function f by some other function which is ‘close to the original one’ and for which 
the integration can easily be performed. In all the methods below, we replace function / by 
a polynomial of small degree. 


Approximation by rectangles 


In this case, we approximate function f by a step function (in each interval a constant function 
is used), i.e. we approximate the definite integral by the sum of the areas of rectangles. We 
obtain 


b b— 
/ fx) dx ғ — [f G9) t f 63) t E fon-)] = IR. 


The above approximation is illustrated in Figure 5.3. This formula gives a value between the 
lower and upper Riemann sum for a specific value of n. When applying approximation by 
rectangles, the error Ale of the approximate value of the definite integral can be estimated 
as follows: 


у 
ioci) max f'(x) 
n 


a<x<b 


b 
An - | f года zc. 


$ 


where c € [0, 1). 


Approximation by trapeziums 


An alternative approximation is obtained when in each closed interval [х;_1,х;] function 
f is replaced by a line segment (i.e. a linear function) through the points (xi—1,f (xi-1)) 
and (x;,f (x;)). This situation is illustrated in Figure 5.4. In this way, we get the following 
approximation formula for the definite integral: 


b b— b 
/ f(x) dx x И ut © t +/ Ол) +f) +f = Пк. 


By the above formula, we approximate the definite integral by the sum rr of the areas 
of n trapeziums. Assuming an equal number n of subintervals, often the approximation by 
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f(a), 


а = To Ti 29 T3 (e 44 In-2 Xn-1 Tn = b т 


Figure 5.3 Approximation by rectangles. 


f(x), 


а= їо X1 T? T3 2E In-2 In-1 Lp = Б x 


Figure 5.4 Approximation by trapeziums. 


trapeziums gives better approximate values than the approximation by rectangles. This is 
particularly true when the absolute value of the first derivative in the closed interval [a, 5] 


can become large and thus we may have big differences in the function values in some of the 
subintervals. 
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When determining the definite integral by the above approximation by trapeziums, we get 
the following estimation for the maximum error А/тв of the approximate value Jrg: 


b (b а)? 
| тода ns x NUT d 


Nim 
TE 120? 


max f" (x) 
axxxb 


From the above formula we see that the maximum error can become large when the absolute 
value of the second derivative of function f is large for some value in the closed interval 
[a, b]. Moreover, the smaller the interval length h is, the better the approximation. 


Kepler 5 formula 


Here we consider the special case when the closed interval [а, b] is divided only into two 
subintervals [a, (a + b)/2] and [(a + b)/2,6] of equal length. But now we approximate 
function f by a quadratic function (parabola) which is uniquely defined by three points, and 
here we use the points (a, f (a)) , ((a + b)/2,f ((a + b)/2)) and (b, f (b)) (see Figure 5.5). 


f(x), 


Figure 5.5 Kepler’s formula. 


This leads to Kepler’s formula: 


b b—a a+b 
| f(x) dx © E [ro +47 (>) «ro = Ik. 
a 
The error А/к of the approximate value J of the definite integral can be estimated as follows: 


(b а)? 


2.880 max f? (x) 


a<x<b 


b 
Aly = pi Года — I| = 
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The formula shows that a large absolute value of the fourth derivative for some value x є [а,Ь] 
can lead to a large error of the approximate value. If function f is a polynomial of degree no 
greater than three, then the value /к is even the exact value of the definite integral since for 
each values х € [а, Ё], the fourth derivative is equal to zero in this case. 


Simpson's formula 


This formula is a generalization of Kepler's formula. The only difference is that we do not 
divide the closed interval [a,b] into only two subintervals but into a larger even number 
n — 2m of subintervals. Now we apply Kepler's formula to any two successive intervals. 
This leads to Simpson's formula: 


b 
р 
| f(x) dx © uu : [f (a) +46 ол) + f (3) +: f 2m-1) 


+2(f (2) +f G4) + +++ +f G2m-2)) +f (5)] = Is. 


We can give the following estimate for the maximum error AJs of the approximate value Is: 


b (b — ay 
ль = | [ feodi асана in 


(4) 
180 max f^" (x) 


a<x<b 


Example 5.18 We illustrate approximation by rectangles, trapeziums and by Simpson’s 
formula by looking at the example 


6 6 
[| f@dx = Jj E dx, 
2 2 X 


where we use n — 8 subintervals for each method. We get the function values given in 
Table 5.1. 


Table 5.1 Function values for 
Example 5.18 


Xi feu) 


i 

0 2 3.6945 
1 2.5 4.8730 
2 3 6.6952 
3 3.5 9.4616 
4 4 13.6495 
5 4.5 20.0038 
6 5 29.6826 
7 5.5 44.4894 
8 6 67.2381 
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If we use approximation by rectangles, we get 


6 b— 
| f Gode PE уо) оа) 4f 69] 


: 132.5497 = 66.2749. 


ol A 


A^ 


Applying approximation by trapeziums, we get 


6 b— 
Í f G9) dx s — = |: aw M 0 а) 


^ 


4 
а 164.3215 = 82.1608. 


If we apply Simpson’s formula, we get 


6 ех b— 
Í mr dx 7 7 [Р (xo) + 4701) + 2700) + Af (x3) + 2f (x4) + Af (х5) + 2f (xe) 


4 
+47 (x7) +f (xg)] © 54` 486.2984 ~ 81.0497. 


5.5 IMPROPER INTEGRALS 


So far, we have made the following two basic assumptions in defining the definite integral 
of a function f over the interval [a, b]. 


(1) The limits of integration a and b are both finite. 
(2) The function is continuous and therefore bounded on the closed interval [a, b ]. 


If either of the conditions (1) and (2) is not satisfied, the definite integral is called an improper 
integral. We now consider both cases separately. 


5.5.1 Infinite limits of integration 


In this case, we define the improper integral appropriately as the limit of certain definite 
integrals with finite limits of integration. 


220 Integration 


Definition 5.4 Let one or both limits of integration be infinite. Then we define the 
improper integrals as follows: 


oo b 
f Го) = im | лода 


b b 
| fo de = tim | f (x) dx 


oo b 
jl f (x)dx = jim [ fora 


a—>—oo 


provided that these limits exist. If a limit does not exist, the corresponding improper 
integral has no value and is said to be divergent. 


By using Newton-Leibniz's formula for definite integrals, we can evaluate the improper 
integrals given above as follows: 


Evaluation of improper integrals: 
оо b 
|| f(x) de = lim FG)| = lim F(b) — F(a) 
a b—oo а — boo 
b b 
| fG)dx— lim Fo) | = F(b)— lim F(a) 
—oo а —oo a а —oo 


oo b 
| fG)dx— lim Fo) | = lim F(b)— lim F(a) 
—oo boo a b—oo а> —oo 


а —oo 


Example 5.19 We evaluate the improper integral 


Ses 
j — e dx. 
о 3 


We obtain 


5.5.) Unbounded integrands 


Similarly to case (1), we define the improper integral again as a limit of certain definite 
integrals. 
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Definition 5.5 Let function f : Dy — R be continuous on the right-open interval 
[a, b) € Dy but definitely divergent as x tends to b — 0. Then we define the improper 


integral E f(x) dx as 


b t 
Jl f(x)d& = lim f f (x) dx 
a t—b—0 Ja 


provided that this limit exists. Otherwise the improper integral has no value and is said 
to be divergent. 
Let function f be continuous on the left-open interval (a, b] C Dy but definitely 


divergent as x tends to a + 0. Then we define the improper integral [р f (x) dx as 


b b 
|) Р(х) ах = Jim, f f (x) dx 


provided that this limit exists. Otherwise the improper integral has no value and is said 
to be divergent. 


Example 5.20 Ме evaluate the improper integral 


ce 
0 Vx 


Since function 1/4/x is definitely divergent as x tends to 0 + 0, we apply Definition 5.5 and 


obtain 
1 а. 1 2/3 | 3 1 
X С = " x Д 
== lim x Зах = lim >| = lim |>- x?/3 
о Ух 1040S; 010 2 Ir t>0+0\2 t 


з 3 ) 3 
= se dye 
(5 2 15040 ) 2 


Thus the above improper integral has the value 3/2. 


Finally, we consider an example where a point xo exists in the interior of the closed interval 
[a, b], where function f is definitely divergent as x tends to xo. This case can be reduced to 
the consideration of two integrals according to Definition 5.5. 


Example 5.21 Consider the improper integral 


f dx 
1 x2 
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This integrand has a discontinuity (pole of second order) atx = 0. Therefore we must partition 
the integral into two integrals, each of them with an unbounded integrand according to one 
of the cases presented in Definition 5.5, and we obtain 


[ dx й dx ji 1 dx 


1х2 520-0] х2 52040], x? 


ti 1 
+ lim [—- 
-1 -0+0\ x 
; 1 І 1 
== +1] + lm {1+ | = оо. 
1-0-0 ty 1504-0 t? 


Thus, this improper integral has no finite value. Notice that, if we did not take into considera- 
tion that the integrand is definitely divergent as x tends to zero and applied Newton-Leibniz's 
formula for evaluating a definite integral, we would obtain the value —2 for the above integral 
(which is obviously false since the integrand is a non-negative function in the interval so that 
the area cannot be negative). 
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5.6 SOME APPLICATIONS OF INTEGRATION 


In this section, we discuss some economic applications of integration. 


5.6.1 Present value of a continuous future income flow 


Assume that in some time interval [0, T] an income is received continuously at a rate of f (t) 
EUR per year. Contrary to the problems considered in Chapter 2, we now assume that interest 
is compounded continuously at a rate of interest i. Moreover, we denote by P(t) the present 
value of all payments made over the time interval [0, t]. In other words, the value P(t) gives 
the amount of money one would have to deposit at time zero in order to have at time T the 
amount which would result from depositing continuously the income flow f (t) over the time 
interval [0, Т]. 


In Chapter 2, we discussed the case of a discrete compounding of some amount. Let A be 
the amount due for payment after ¢ years with a rate of interest i per year. Then the present 
value of this amount A is equal to 


Р=4А.(1+07 


(see Chapter 2). In the case of m payment periods per year, the present value would be 


i —mt 
psa (iti) 7. 
m 


If we assume continuous compounding, we have to consider the question of what happens if 
m — oo. We set n = m/i. Notice that from m — oo it follows that n — оо as well. Now the 
present value P of the amount A is given by 


ZUM 
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Using the limit 


1 n 
lim (: + 1) = е, 
поо п 


we find that, in the case of a continuous compounding, the present value Р of amount A is 
given by 


P=A-e%. 


Now returning to our original problem, we can say that f (t) · At is approximately equal to 
the income received in the closed interval [t,t + At]. The present value of this amount at 
time zero is therefore equal to f (t) - At - e". Taking into account that this present value is 
equal to the difference P(t + At) — P(t), we obtain 


P(t+ At) — P(t) 


TT ж f (t) e. 


The left-hand side is the difference quotient of function P (considering points ѓ and t + At). 
Taking the limit of this difference quotient as At tends to zero, we obtain 


P'O rae. 


Evaluating now the definite integral of this integrand from 0 to T', we get 


T T 
P(T) — P(0) = f P'(t) dt = i fO. e ! dt. 
0 0 


Since P(0) — 0 by definition, we finally obtain 


T 
P(T) = f f «e dt. 
0 


Example 5.22 Let function f : Dy — R with f (t) = 30t + 100 (in EUR) describe the 
annual rate of an income flow at time f continuously received over the years from time t = 0 
to time t = 5. Moreover, assume that the rate of interest is i = 0.06 (or, which is the same, 
p = i: 100 per cent = 6 per cent) compounded continuously. Applying the formula derived 
above, the present value at time zero is obtained by 


5 
Po = | Gore 100) e a 
0 
Applying integration by parts with 
u(t) = 30t + 100 and v (t) = ett, 
we obtain 


1 
РА 3 | ,-0.06t 
u (t) = 30 and v(t) — 0.06 e Я 
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and thus 


5 30 [3 
+ e 0-06t dt 


1 
P(5) = ——— - (30t + 100) . e 906r 
(9) Р отоо 


0.06 


5 30 5 
= —— . (30t + 100) . e 996 e 0-06 
ly 062 
1 


30 \ 5 
—0.06t 

aoe . (30t + 100 + —— | 
0.06 ^ (зо m +o) 
1 


0 


0 


5 
—0.06t 
TUR - (30t 600) 
0.06 ^ Sora POE 


epee -e 93 . (150 + 600) + 20; . e? . 600 
0.06 0.06 
750 00 


750-03 , 000 — 239 77 EUR, 
0.06 0.06 


i.e. the present value at time zero is equal to 739.77 EUR. 


5.6.2 Lorenz curves 


These are curves that characterize the income distribution among the population. The Lorenz 
curve L is defined for 0 < x < 1, where x represents the percentage of the population 
(as a decimal), and L(x) gives the income share of 100x per cent of the poorest part of the 
population. Therefore, for each Lorenz curve, we have L(0) = 0, L(1) = 1,0 < L(x) € x, 
and L is an increasing and convex function on the closed interval [0, 1]. 


Then the Gini coefficient G is defined as follows: 


1 
G= 2 | [x — L(x)] dx. 
0 


Factor two is used as a scaling factor to guarantee that the Gini coefficient is always between 
zero and one. The smaller the Gini coefficient is, the fairer is the income distributed among 
the population. The Lorenz curve and the Gini coefficient are illustrated in Figure 5.6. They 
can also be used e.g. for measuring the concentration in certain industrial sectors. 


Example 5.23 Assume that for Nowhereland the Lorenz curve describing the income 
distribution among the population is given by 


3, 2 
L(x) = 5х + 5°? Dı = [0,1]. 


First we note that this is a Lorenz curve since L(0) = 0, L(1) = 1 and the difference 


3 2 3 3 3 
x—L(x) =x ЕЕ sr -.xd x) 
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is non-negative for 0 < x < 1 since all factors are non-negative in the latter product 
representation. (Notice also that function L is strictly increasing and strictly convex on the 
closed interval [0, 1].) We obtain 


1 1 3 3 
6=2 | [x — L(x)] a=2 | (3 x) dx 
0 0 5 5 


3 1 
= >x — Pi | = = 
10 15 0 5 


Since the Gini coefficient is rather small, the income is ‘rather equally’ distributed among 
the population (e.g. L(0.25) = 0.1375 means that 25 per cent of the poorest population still 
have an income share of 13.75 per cent and L(0.5) = 0.35 means that 50 per cent of the 
poorest population still have an income share of 35 per cent). 


Half Gini coefficient (€) 


2 


Lorenz curve L(x) 


Y 


Figure 5.6 Lorenz curve and Gini coefficient. 


5.6.3 Consumer and producer surplus 


Assume that for a certain product there is a demand function D and a supply function 5, 
both depending on the price p of the product. The demand function is decreasing while the 
supply function is increasing. For some price p*, there is an equilibrium called the market 
price, i.e. demand is equal to supply: D(p*) — S(p*). Some consumers are willing to pay 
a higher price than p* until a certain maximum price pmax is reached. On the other hand, 
some producers could be willing to sell the product at a lower price than p*, which means 
that the supply S(p) increases from the minimum price pmin. Assuming that the price p can 
be considered as a continuous variable, the customer surplus CS is obtained as 


Pmax 
cs- | рур, 
p* 
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while the producer surplus PS is obtained as 


ж 


р 
PS = f S(p) dp. 
Pmin 


The consumer surplus CS can be interpreted as the total sum that customers save when 
buying the product at the market price instead of the price they would be willing to pay. 
Analogously, the producer surplus is the total sum that producers earn when all products are 
sold at the market price instead of the price they would be willing to accept. The producer 
surplus PS and the consumer surplus CS are illustrated in Figure 5.7. 


Y 


P, min p* P max P 


Figure 5.7 Producer and consumer surplus. 


Example 5.24 Assume that the demand function D is given by 
D(p) = 5p? — 190p + 1,805 

and the supply function S is given by 
S(p) = 20p* — 160p + 320. 


First, we mention that we get the minimum price pmin = 4 from S(p) = 0 and the maximum 
price Pmax = 19 from D(p) = 0. Moreover, we note that the demand function is in fact 
strictly decreasing due to D'(p) = 10p — 190 < 0 for p < 19, and the supply function is 
strictly increasing since S’(p) = 40р — 160 > 0 for p > 4. The market price р“ is obtained 
from D(p*) = S(p*): 


5(p*)? — 190p* + 1,805 = 20(p*)? — 160p* + 320 


Integration 


which yields 
15(p*? + 30p* — 1,485 = 0, 
or, after dividing by 15, equivalently, 
(p*)? + 2p* — 99 = 0. 
From the latter quadratic equation, we get the zeroes 


pi =-14+V14+99=9 and pp =-1-V14+99=-11 
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Since the second root is negative, the market price is p* = рү = 9. Hence, the consumer 


surplus is obtained as 
19 19 
CS = Dp dp = f (5p? — 190p + 1,805) dp 

9 9 

3 2 19 5 19 

p р 3 2 
zs. — 100. + 1,805 | = (2. p> — 95р? + 1,805 | 
( 3 2* »), E mW г), 


_ (34,295 
B 3 


The producer surplus is obtained as 
9 9 
PS = / $(р) dp = (20р? — 160p + 320) ар 
4 4 


3 2 
P P 9 20 3 2 5 
—[20.— — 160. — + 320 | = | — -p` — 80р + 320, | 
( 3 2 ») | (5 5 p P 4 


1,280 
= 4, 860 — 6, 480 + 2, 880 — ( 3 


) = 833.33. 


— 34,295 + 34, 295) — (1,215 — 7,695 + 16,245) = 1, 666.67. 


EXERCISES 


5.1 Use the substitution rule to find the following indefinite integrals: 


(a) jee cosxdx; (b) [а ој —e 
e f 3x: e 22 о f ==. 
(g) ie LN Сас w f Ta of a 


of ө 
TE 2sinx + sin 2х. 


Бы 
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5.2 


5.3 


5.4 


5.5 


5.6 


Use integration by parts to find the following indefinite integrals: 


(a) [Peas e) | e cost © | ы dx; 


cos? x 


(d) f cos^xdx; (е) / x^Inxdx; (f) | In(x* + 1) dx. 


Evaluate the following definite integrals: 


of 24 o f = ө | ашха 
x X e "mm c sin” x dx; 
-1 о 1+ Xx 0 
^ хау t dx 1 2 
а ——; e f —, t; f sin x cos? x dx; 
Б 1 25 " о 2х—1 2 u^ 


0 dx 
(в) / 8 
-1 x“ + 2х + 2 
A firm intends to pre-calculate the development of cost, sales and profit of a new 


product for the first five years after launching it. The calculations are based on the 
following assumptions: 


e t denotes the time (in years) from the introduction of the product beginning with 
t= 0; 

ө C(t) = 1,000. [4 — 2e^/(e + 1)] is the cost as a function of є R,t > 0; 

e S(t) = 10,000 - £2 - ет are the sales as a function of t € R,t > 0. 

(a) Calculate total cost, total sales and total profit for a period of four years. 


(b) Find average sales per year and average cost per year for this period. 
(c) Find the total profit as a function of the time t. 


(a) Find 


2л 
| sin x dx 
0 


and compute the area enclosed by function f : IR — R with f(x) = sin x and the 
x axis. 

Compute the area enclosed by the two functions fı : R > R and fh : К > R 
given by 


(b 


— 


А0) = 4x and р(х) = 3x +6. 


The throughput q = q(t) (output per time unit) of a continuously working production 
plant is given by a function depending on time t: 


The throughput decreases for t = 0 up to t = 10 from qo to 0. One overhaul during the 
time interval [0, 7] with T < 10 means that the throughput goes up to qo. After that 
it decreases as before. 


5.7 


5.8 


5.9 
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(a) Graph the function g with regard to the overhaul. 

(b) Let ө = 4 be the time of overhaul. Find the total output for the time interval 
[0, T] with T > 4. 

(c) Determine the time fo of overhaul which maximizes the total output in the interval 
[0, Т]. 


Determine the following definite integral numerically: 


f dx 
0 12x 


(a) Use approximation by trapeziums with n — 10. 
(b) Use Kepler's formula. 
(c) Use Simpson’s formula with n = 10. 


Compare the results of (a), (b) and (c) with the exact value. 


Evaluate the following improper integrals: 


| f х dy; ef = о [еа 
2) уе т ge So ES 
оо 4 dx ? 2x -1 
2,—ÀX дү. ——* Fk туу 
of Ax^e ^ dx; © | T ч) 0 ара 2)“ 


Let function f with 
f (t) = 20t + 200 


(in EUR) describe the annual rate of an income flow at time ¢ continuously received 
over the years from time t = 0 to time t = 6. Interest is compounded continuously at 
a rate of 4 per cent p.a. Evaluate the present value at time zero. 


Given are a demand function D and a supply function S depending on price p as 
follows: 


8 4 
D(p) = 12 — 2p and SU аре, 


Find the equilibrium price p* and evaluate customer surplus CS and producer 
surplus PS. Illustrate the result graphically. 


6 Vectors 


In economics, an ordered n-tuple often describes a bundle of commodities such that the ith 
value represents the quantity of the ith commodity. This leads to the concept of a vector 
which we will introduce next. 


6.1 PRELIMINARIES 


Definition 6.1 А vector a is an ordered n-tuple of real numbers aj, a2,..., dp. 


The numbers aj, a2,..., an are called the components (or coordinates) of vector a. 
We write 
aj 
a2 
T 
a= . = (a1,a2,..., ag) 
ап 


for a column vector. For the so-called transposed vector al which is a row vector obtained 
by taking the column as a row, we write: 
T 
ai 
a2 
T 
a = (а1,42,...,ап) = 


an 


We use letters in bold face to denote vectors. We have defined above a vector a always as a 
column vector, and if we write this vector as a row vector, this is indicated by an upper-case 
T which stands for ‘transpose’. The convenience of using the superscript T will become 
clear when matrices are discussed in the next chapter. If vector a has n components, we say 
that vector a has dimension n or that a is an n-dimensional vector or n-vector. If not noted 
differently, we index components of a vector by subscripts while different vectors are indexed 
by superscripts. 
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Definition 6.2 The n-dimensional (Euclidean) space К” is defined as the set of all 
real n-tuples: 


RES а c de rios M2 н 


Similarly, R^. stands for the set of all non-negative real n-tuples. We can graph a vector 
as an arrow in the n-dimensional space which can be interpreted as a displacement of a 
starting point P resulting in a terminal point Q. For instance, if point P has the coordinates 
(P1, P2,- - - ,pq) and point О has the coordinates (q1,q2,..-,qn), then 


qı — P1 
а=РО= | ? P 
Qn — Pn 


It is often assumed that the starting point P is the origin of the coordinate system. In this 
case, the components of vector a are simply the coordinates of point Q and, therefore, a row 
vector al can be interpreted as a point (i.e. a location) in the n-dimensional Euclidean space. 
In the case ofn = 2, we can illustrate vectors in the plane, e.g. the vectors 


4 —1 
а= ( 3 | апа b= ( 2 ) 
are illustrated in Figure 6.1. Finding the terminal point of vector a means that we are going 


four units to the right and three units up from the origin. Similarly, to find the terminal point 
of b, we are going one unit to the left and two units up from the origin. 


X234 


-1 


Figure 6.1 Representation of two-dimensional vectors a and b. 
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Next, we introduce some relations on vectors of the same dimension. 


Definition 6.3 Let a,b є R” with a = (aj1,a2,...,a5)! and b = (bj, b2,..., bn)". 
The vectors a and b are said to be equal if all their corresponding components are 
equal, i.e. a; = bj for alli = 1,2,...,n. 

We write a < b ifa; < b; and a > b if aj > bj for all i = 1,2,...,n. Analogously, we 
write a < b ifa; < b; and a > b if a; > bj for all i = 1,2,...,n. 


Remark Note that not every pair of n-dimensional vectors may be compared by the relations 
< and >, respectively. For instance, for the vectors 


3 2 
а= | -1 and b=] 0 |, 
2 2 


we have neither a < b nor a > b. 


Example 6.1 Consider the vectors 


2 1 0 
a= 4 " b= —2 and с = 3 
3 3 2 


Then we have a > b, a > c, but vectors b and c cannot be compared, i.e. we have neither 
b < споге < b. 


Special vectors 


Finally, we introduce some special vectors. By e! we denote the ith unit vector, where the ith 
component is equal to one and all other components are equal to zero, i.e. 


0 
e=| 1 < ith component 


0 
The n-dimensional zero vector is a vector containing only zeroes as the и components: 


0 
0 


e 
ll 
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6.2 OPERATIONS ON VECTORS 


We start with the operations of adding two vectors and multiplying a vector by a real number 
(scalar). 


Definition 6.4 Let a,b є R” with a = (a1,a2,...,a5)] and b = (b1, b2,..., bn)". 
The sum of the two vectors a and b is the n-dimensional vector a + b obtained by 
adding each component of a to the corresponding component of b: 


а bi a, + bi 

a2 b2 az + b2 
a + b= : oF : = к 

an b, ап + bn 


Definition 6.5 Leta є R” with a = (a1,a2,...,a4)] and à є R. The product of 
number (or scalar) à and vector a is the n-dimensional vector Аа whose components 
are A times the corresponding components of a: 


ay Ла 

а2 har 
À dau À = } 

ап Xan 


The operation of multiplying a vector by a scalar is known as scalar multiplication. 


Using Definitions 6.4 and 6.5, we can now define the difference of two vectors. 


Definition 6.6 Let a,b є R” be n-dimensional vectors. Then the difference between 
the vectors a and b is defined by 


a— b = a + (—1)b. 


According to Definition 6.6, the difference vector is obtained by subtracting the components 
of vector b from the corresponding components of vector a. Notice that the sum and the 
difference of two vectors are only defined when both vectors a and b have the same dimension. 


Example 6.2 Let 
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Then we obtain 


п) m nmn D-0)-() 


Applying Definition 6.5, we obtain 


(DH) meme em (i) (F) 


The sum and difference of the two vectors, as well as the scalar multiplication, are geomet- 
rically illustrated in Figure 6.2. The sum of the two vectors a and b is the vector obtained 
when adding vector b to the terminal point of vector a. The resulting vector from the origin 
to the terminal point of vector b gives the sum a 4- b. 


X24 


Figure 6.2 Vector operations. 


We see that multiplication of a vector by a positive scalar à does not change the orientation 
of the vector, while multiplication by a negative scalar reverses the orientation of the vector. 
The difference a — b of two vectors means that we add vector b with opposite orientation to 
the terminal point of vector a, i.e. we add vector —b to vector a. 


Next, we summarize some rules for the vector operations introduced above. Let a, b, c є R” 
and A, и є R. 
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Rules for vector addition and scalar multiplication 


(1) a+b=b +a, Аа = а}; 
(commutative laws) 

(2) (a+b)+c=a+(b+e), (Аја = A(ua); 
(associative laws) 


(3) A(a+b) = Ха + Ab, (A+ u)a = àa + pa; 
(distributive laws) 

(4) a+0=a, а + (а) = 0; 
(0 € R”) 

(5) la=a. 


The validity of the above rules follows immediately from Definitions 6.4 and 6.5 and 
the validity of the commutative, associative and distributive laws for the set of real 
numbers. 


Definition 6.7 The scalar product of two n-dimensional vectors a = (a1, a2, . . . , an)" 
and b = (bi, b2,..., bn)" is defined as follows: 


bi 

T b2 : 

a -b= (а1,а2,...,аһ): ; = ajbi + ab; +... + anbn = Y aibi. 
ы {е1 
bn 


The scalar product is also known as the inner product. Note also that the scalar product of 
two vectors is not a vector, but a number (i.e. a scalar) and that a! - b is defined only if a 
and b are both of the same dimension. In order to guarantee consistence with operations in 
later chapters, we define the scalar product in such a way that the first vector is written as a 
row vector a! and the second vector is written as a column vector b. The commutative and 
distributive laws are valid for the scalar product, i.e. 


al.b—bl.a and a’. (b+c)=a'-b+al-c 


for a, b, c € IR". It is worth noting that the associative law does not necessarily hold for the 
scalar product, i.e. in general we have 


a- (bT - c) £ (a! - b).c. 


Example 6.3 Assume that a firm produces three products with the quantities xı = 30, 
x2 = 40 and x3 = 10, where x; denotes the quantity of product i. Moreover, the cost of 
production is 20 EUR per unit of product 1, 15 EUR per unit of product 2 and 40 EUR per 
unit of product 3. Let c = (c1, c2, сз) be the cost vector, where the ith component describes 
the cost per unit of product i and x = (x1,x2,x3)'. The total cost of production is obtained 
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as the scalar product of vectors c and x, i.e. with 


20 30 
c= 15 and х= | 40 
40 10 
we obtain 
30 
cT. x = (20,15,40). | 40 | =20-30+15-40+ 40-10 
10 


= 600 + 600 + 400 = 1, 600. 


We have found that the total cost of production for the three products is 1,600 EUR. 


Definition 6.8 Let a є R with a = (aj,a2,...,a5)]. The (Euclidean) length (or 
norm) of vector a, denoted by |a], is defined as 


lal = ya? +a +... ad. 


A vector with length one is called a unit vector (remember that we have already introduced 
the specific unit vectors e!,e?,...,e” which obviously have length one). Each non-zero 
n-dimensional vector a can be written as the product of its length |a| and an n-dimensional 
unit vector e pointing in the same direction as the vector a itself, i.e. 


a = |а|. e®. 


Example 6.4 Let vector 


—2 
a= 3 
6 


be given. We are looking for a unit vector pointing in the same direction as vector a. Using 


lal = /(—2)? + 32 + 62 = V49 = 7, 


we find the corresponding unit vector 


i 1, (2 32/7 
е0 = —.а=-.1 3 |—<[ 3/7 
lal T Y 6 6/7 


Using Definition 6.8, we can define the (Euclidean) distance between the n-vectors a = 
(а1,42,...,а)1 and b = (bj, b2,...,5,)! as follows: 


|a — Ы = y (а — b1)? + (a — b2)? +... + (an — Ь,)2. 
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The distance between the two-dimensional vectors a and b is illustrated in Figure 6.3. It 
corresponds to the length of the vector connecting the terminal points of vectors a and b. 


by ay X1 


Figure 6.3 Distance between the vectors a and b. 


Example 6.5 Let vectors 


3 -1 


be given. The distance between both vectors is given by 


la — b| = VB — (-DP + (2 — 1)2 + (-3— 5? = V81 = 9. 


Next, we present some further rules for the scalar product of two vectors and the length of a 
vector. Let a, b є R” and A є К. 


Further rules for the scalar product and the length 


(1) |a| = мат .а > 0; 

(2) jal =0 <= а=0; 

(3) [Аа| = |А: al; 

(4) la+ b| < [а| + |Ы; 

(5) a! -b = |а|. [Ы - cos(a, b); 

(6) |aT -b| < |а|. |b| (Cauchy-Schwarz inequality). 
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In rule (5), cos(a, b) denotes the cosine value of the angle between vectors a and b. We 
illustrate the Cauchy-Schwarz inequality by the following example. 


Example 6.6 Let 
2 5 
a= —1 апа b=]| —4 
3 —1 


For the lengths of vectors a and b, we obtain 


la| = V22 + (—1)2 + 32 = v14 апі Ы = V52 + (—4)? + (-1? = V42. 
The scalar product of vectors а and b is obtained as 
al.b 22.54 (-1)-(-4) -3-(-1) =104+4-3=11. 


Cauchy-Schwarz's inequality says that the absolute value of the scalar product of two vectors 
a and b is never greater than the product of the lengths of both vectors. For the example, this 
inequality turns into 


lay - b| = 11 < |а|: |Ы = V14- /42 ~ 24.2487. 


Example 6.7 Using rule (5) above, which can also be considered as an alternative equiv- 
alent definition of the scalar product, and the definition of the scalar product, according to 
Definition 6.7, one can easily determine the angle between two vectors a and b of the same 
dimension. Let 


Then we obtain 


al.b 
lal - |b| 
= 3-2+(-1)-1+2-2 EE AN 
Мз? +(—1)2+22./2+11+22 14.9 А14 


cos(a, b) = 


£ 0.80178. 


We have to find the smallest positive argument of the cosine function which gives the value 
0.80178. Therefore, the angle between vectors a and b is approximately equal to 36.7°. 


Next, we consider orthogonal vectors. Consider the triangle given in Figure 6.4 formed by 
the three two-dimensional vectors a, b and a — b. Denote the angle between vectors a and b 
by y. From the Pythagorean theorem we know that angle y is equal to 90? if and only if the 
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sum of the squared lengths of vectors a and b is equal to the squared length of vector a — b. 
Thus, we have: 


y = 90° <> la? + |b]? = |a — b? 
«= al.a+b'-.b=(a—b)!-(a—b) 
<> al.a+b!’-b=a!l-a—a!’-b—b!-a+b'-b 
<=» аї.Ь = 0. 
The latter equality has been obtained since a! - b = b! - a. For two-dimensional vectors 
we have seen that the angle between them is equal to 90° if and only if the scalar product is 
equal to zero. We say in this case that vectors a and b are orthogonal (or perpendicular) and 


write a L b. The above considerations can be generalized to the n-dimensional case and we 
define orthogonality accordingly: 


alb< >a!-b=0, 


where a,b є К”. 


a 


Figure 6.4 Triangle formed by vectors a, b and a – b. 


Example 6.8 The three-dimensional vectors 


3 4 
a= —1 апа b= 6 
2 —3 


are orthogonal since 


al.b 23.44 (-1)642-(—3) = 0. 
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63 LINEAR DEPENDENCE AND INDEPENDENCE 
In this section, we discuss one of the most important concepts in linear algebra. Before 


introducing linearly dependent and linearly independent vectors, we give the following 
definition. 


Definition 6.9 Let a’, i = 1,2,...,m, be n-dimensional vectors, and A;, i = 
1,2,...,m, be real numbers. Then the n-vector a given by 
m 
а = àja! + Аа? +... + Àma” = Уа (6.1) 


O А and ER 


in representation (6.1), then a is called a convex combination of the vectors 


al, a2,... , a". 


The set of all convex combinations of the two vectors a! and a? is illustrated in Figure 6.5. 
It is the set of all vectors whose terminal points are on the line connecting the terminal 
points of vectors a! and a?. Therefore, both vectors c! and c? can be written as a convex 
combination of the vectors а! and a?. Notice that for A; = 1 and à = 1 — A, = 0 we 
obtain vector al, whereas for A; = 0 and Az = 1 — А = 1 we obtain vector a2. Note 
that a convex combination of some vectors is also a special linear combination of these 
vectors. 


32 4 


Tı 


Figure 6.5 Set of convex combinations of vectors a! and a”. 
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Definition 6.10 The m n-dimensional vectors al,a?,...,a" є R” are linearly 


dependent if there exist numbers Л;, i = 1,2,...,m, not all equal to zero, such 
that 
m 
p» ма! = Ayal + Аа? +--+ Ama” = 0. (6.2) 
ill 
If equation (6.2) only holds when A; = А =... = A» = 0, then the vectors 
a!,a’,...,a” are said to be linearly independent. 


Since two vectors are equal if they coincide in all components, the above equation (6.2) 
represents n linear equations with the variables А, à2,..., Am. In Chapter 8, we deal with 
the solution of such systems in detail. 


Remark From Definition 6.10 we obtain the following equivalent characterization of 
linearly dependent and independent vectors. 


(1) A set of m vectors а!, a?,..., a" є R” is linearly dependent if and only if at least one 
of the vectors can be written as a linear combination of the others. 

(2) A set of m vectors a! , а2,...,а” є R” is linearly independent if and only if none of the 
vectors can be written as a linear combination of the others. 


Example 6.9 Let 


og) de guy 


In this case, we have a? — —3a! which can be written as 
3a! + 1a? = 0. 


We can conclude that equation (6.2) holds with A; = 3 and Az = 1, and thus vectors a! and 
a? are linearly dependent (see Figure 6.6). 


Example 6.10 Let 


e) m n) 


In this case, equation Aya! + A3? = 0 reduces to 


ЗА — А = 0 
А + 2А = 0. 
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This is a system of two linear equations with two variables л. and A2 which can easily be 
solved. Multiplying the first equation by two and adding it to the second equation, we obtain 
Ау = 0 and then Аз = 0 as the only solution of this system. Therefore, both vectors a! and 
b? are linearly independent (see Figure 6.6). 


123456782910 «m 


Figure 6.6 Linearly dependent and independent vectors. 


The above examples illustrate that in the case oftwo vectors ofthe two-dimensional Euclidean 
space IR2, we can easily decide whether they are linearly dependent or independent. Two 
two-dimensional vectors are linearly dependent if and only if one vector can be written as a 
multiple of the other vector, i.e. 


(see Figure 6.6). On the other hand, in the two-dimensional space every three vectors are 
linearly dependent. This is also illustrated in Figure 6.6. Vector c can be written as a linear 
combination of the linearly independent vectors a! and b?, i.e. 


c= àja! + А”, 
from which we obtain 
le — Aya! — А262 = 0. (6.3) 


By Definition 6.10, these vectors are linearly dependent since e.g. the scalar of vector c in 
representation (6.3) is different from zero. 


Considering 3-vectors, three vectors are linearly dependent if one of them can be written as a 
linear combination of the other two vectors which means that the third vector belongs to the 
plane spanned by the other two vectors. If the three vectors do not belong to the same plane, 
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these vectors are linearly independent. Four vectors in the three-dimensional Euclidean space 
are always linearly dependent. In general, we can say that in the n-dimensional Euclidean 
space IR", there are no more than п linearly independent vectors. 


Example 6.11 Let us consider the three vectors 


2 1 3 
а= | 0 |, а2= | 1 апа а? = А 
0 0 1 


and investigate whether they are linearly dependent ог independent. Using Definition 6.10, 
we obtain 


2 1 3 2A, + À2 + 3A3 0 
Лу 0 |+» 1 + Аз | 2 = № + 253 = 0 
0 0 1 Аз 0 


Considering the third component of the above vectors, we obtain Аз = 0. Substituting 
Аз = 0 into the second component, we get from Az + 243 = 0 the only solution А = 0 
and considering finally the first component, we obtain from 244 + А2 + 343 = 0 the only 
solution 4; = 0. Since vector АТ = (A1,42,43) = (0,0, 0) is the only solution, the above 
three vectors are linearly independent. 


Example 6.12 The set {e!,e*,...,e”} of n-dimensional unit vectors in the space IR" 
obviously constitutes a set of linearly independent vectors, and any n-dimensional vector 
al = (aj,a2,...,dn) can be immediately written as a linear combination of these unit 
vectors: 
1 0 0 
1 0 n | 
а=а|| . [+a] . |+ +a] . =) ше. 
К i : 1—1 
0 0 1 


In this case, the scalars of the linear combination of the unit vectors are simply the components 
of vector a. 
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6.4 VECTOR SPACES 


We have discussed several properties of vector operations so far. In this section, we introduce 
the notion of a vector space. This is a set of elements (not necessarily only vectors of real 
numbers) which satisfy certain rules listed in the following definition. 


As 


Definition 6.11 Given a set V = {a,b,c,...} of vectors (or other mathematical 
objects), for which an addition and a scalar multiplication are defined, suppose that the 
following properties hold (A, u € IR): 


(1) a+b =b +a; 
(2) (a+b) +c=a+(b+o); 
(3) there exists a vector 0 є V such that for all a є V the equation 
а + 0 = a holds (0 is the zero or neutral element with respect to addition); 
(4) for each a € V there exists a uniquely determined element x € V such that 


a + x = 0 (х = —a is the inverse element of a with respect to addition); 
(5) Ода = (ua); 
(6) l-a=a; 


(7) A(a- b) = ла + Ab; 
(8) (А+ u)a = ла + pa. 


If for any a, b є У, inclusion a+ b є V and for any à є R inclusion Aa є V hold, 
then V is called a linear space or vector space. 


mentioned before, the elements of a vector space do not necessarily need to be vectors 


since other ‘mathematical objects’ may also obey the above rules (1) to (8). Next, we give 
some examples of vector spaces satisfying the rules listed in Definition 6.11, where in each 
case an addition and a scalar multiplication is defined in the usual way. 


Examples of vector spaces 


(1) 
Q) 
(3) 
(4) 
(5) 


To 


the n-dimensional space R”; 

the set of all n-vectors a є R” that are orthogonal to some fixed n-vector b є R”; 
the set of all sequences {an}; 

the set C[a, b] of all continuous functions on the closed interval [a, b]; 

the set of all polynomials 


Р(х) = арх" + as ix" +: + aix + ад 
of a degree of at most n. 


prove this, one has to verify the validity of the rules (1) to (8) given in Definition 6.11 and 


to show that the sum of any two elements as well as multiplication by a scalar again gives an 


Vectors 245 


element of this space. For instance, consider the set of all polynomials of degree n. The sum 
of two such polynomials 


Р} (x) = anx” + арх" +--+ + ajx + ao 


P? (x) = bax” + by ax" | +--+ bix + bo 
gives 
P} (x) + Р(х) = (ag + by)x" + (ani + bn-1)x"T! +--+ + (a1 + b1)x + (ao + Бо), 


i.e. the sum of these polynomials is again a polynomial of degree n. By multiplying a 
polynomial Р! of degree n by a real number A, we obtain again a polynomial of degree л: 


АР} = (Aay)x" + (Аа„—1)х | +- -- + ax + Aag. 


Basis of a vector space; change of basis 


Next, we introduce the notion of the basis of a vector space. 


Definition 6.12 A setB = {b!,b’,...,b”} of linearly independent vectors of a vector 
space V is called a basis of V if any vector a є V can be written as a linear combination 


а = Ар +Agb? +---+A,b” 


of the basis vectors b!,b*,...,b”. The number п = |B| of vectors contained in the 
basis gives the dimension of vector space V. 


For an n-dimensional vector space V, we also write: dim V = n. Obviously, the set B; = 
(el, e,. . . , e") of unit vectors constitutes a basis of the n-dimensional Euclidean space R” 
(see also Example 6.12). This basis В, is also called the canonical basis. The notion of a 
basis of a vector space is a fundamental concept in linear algebra which we will need again 
later when discussing various algorithms. 


Remark (1) An equivalent definition of a basis is that a maximal set of linearly indepen- 
dent vectors of a vector space V constitutes a basis and therefore, the dimension of a 
vector space is given by the maximal number of linearly independent vectors. 

(2) We say that V is spanned by the vectors b! , b?, . . . , b" of B since any vector of the vector 
space V can be ‘generated’ by means of the basis vectors. 

(3) The dimension ofa vector space is not necessarily equal to the number of components of 
its vectors. If a set of n-dimensional vectors is given, they can contain at most n linearly 
independent vectors. However, any n linearly independent vectors of an n-dimensional 
vector space constitute a basis. 


Next, we establish whether an arbitrary vector of a vector space can be written as a linear 
combination of the basis vectors in a unique way. 
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THEOREM 6.1 Let В = (bl, b?,..., b") be a basis of an n-dimensional vector space V. 
Then any vector с € V can be uniquely written as a linear combination of the basis vectors 
from set B. 


PROOF We prove the theorem indirectly. Assume there exist two different linear combina- 
tions of the given basis vectors from B which are equal to vector c: 


с = АМЬ + à2b? +--+. + ànb” (6.4) 
and 
c = uj b! + ub? +--+ + unb”, (6.5) 


i.e. there exists an index i with 1 < i < n such that à; zz mi. By subtracting equation (6.5) 
from equation (6.4), we get 


0 = (Ay — ш)Ь! + (А — u2)b? + -+-+ An — Hn)b” 


Since the basis vectors b!,b*,...,b” are linearly independent by Definition 6.12, we must 
have 
Ay — ш = 0, À2 — 2 = 0, spy An — Un = 9, 


which is equivalent to 
AL = li, А = ua, eL An = Ш, 


i.e. we have obtained a contradiction. Thus, any vector of V can be uniquely written as a 
linear combination of the given basis vectors. [| 


While the dimension of a vector space is uniquely determined, the basis is not uniquely 
determined. This leads to the question of whether we can replace a particular vector in the 
basis by some other vector not contained in the basis such that again a basis is obtained. 
The following theorem shows that there is an easy way to answer this question, and from 
the proof of the following theorem we derive an algorithm for the replacement of a vector in 
the basis by some other vector (provided this is possible). The resulting procedure is a basic 
part of some algorithms for the solution of systems of linear equations or linear inequalities 
which we discuss in Chapter 8. 


THEOREM 6.2 (Steinitz's theorem) Let set В = {b!,b*,...,b”} be a basis of an 
n-dimensional vector space V and let vector а“ be given by 


ak cb + à2b? cd EA 


with Ак Æ 0. Then the set B* = (b!, b2,. .., b^! , аб, b**!, ..., b”} is also a basis of V, і.е. 
vector b contained in basis В can be replaced by the vector а“ to obtain another basis. 


PROOF Let us consider the following linear combination of the zero vector: 


pib! + uzb? +--+ ppb"! + ugat + шеъ +--+ nb” = 0. (6.6) 
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By substituting the linear combination 
аб = Aib! + Agb? + +--+ ASb" 
into equation (6.6), we obtain: 


(u1 + paa) b! + (u2 + uA)? + E Qua + ш)! 
+ Qua) b* + (шет + Bau) b. e sss + (un + Ukàn)b” = 0. 


Since the vectors of the set B = {b!, b?, . . . , b") constitute a basis, they are linearly indepen- 
dent and all the scalars in the above linear combination must be equal to zero, i.e. we get 


(ш + ик) = 0 for i=1,2,....n,1#k and икАк = 0. 


Since by assumption А; Æ 0, we get first и; = 0 and then, using the latter result, ш; = 0 
for alli with 1 < i < n, i Æ k, i.e. all the scalars in the linear combination (6.6) must be 
equal to zero. Hence, the vectors of set B* = {b!,b?,...,b“—!,a*, b*+!,..., b? are linearly 
independent by Definition 6.10 and they constitute a basis. a 


We know that any basis ofa certain vector space consists of the same number of vectors. So, if 
we want to remove one vector from the current basis, we have to add exactly one other vector 
to the remaining ones such that the resulting set of vectors is again linearly independent. 
We now look for a procedure for performing such an interchange of two vectors described 
in the proof of Theorem 6.2. To this end, assume that B = {b!,..., bi! pt pH. рл 
is a basis and B* = (bl,..., b*-l, аќ, Ь!,...,Ь"} is another basis, where vector bř has 
been replaced by vector аќ. According to Theorem 6.2, we must have Ад Æ 0 in the linear 
combination of vector 


a! = yb! 4 is Abe +---+A,b” (6.7) 


of the vectors of basis В since otherwise a replacement of vector b* by vector а“ is not 
possible. Let us consider an arbitrary vector с and its linear combinations of the basis vectors 
of bases B and В“, respectively: 


e— оь! +... ор ab"! + арЫ + oy yb t! +--+ anb” (6.8) 
and 
c= fib! +- + Ве 10! + Ba! + Вен +--+ + Bub". (6.9) 
By substituting representation (6.7) of vector а“ into representation (6.9), we obtain 
с = Bib! +--+ By ab! + Barb! АЫ + Anb”) 
+ Вены +--+ + Bub" 
= (Bi + Bia) b! d Ra + Boa) DY! + Вуд b* 
+ (Вен Baa) bt! + +--+ (Bn + Bean) b”. 
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Comparing now the scalars of the vectors from the basis В = {b!,b’,...,b”} in both 
representations of vector c (i.e in representation (6.8) and the last equality above) first for k 
and then for all remaining i Æ k, we first obtain 


Ok 
= m 


Bk (6.10) 


and then from 
oj = Bi + бк, і = 1,2,...,п, i Æ К, 


it follows Бу means of equality (6.10) 
Bi =a;-— о, i-l2,..nizk. 


In order to transform the linear combination of vector c of the basis vectors of B into the linear 
combination of the basis vectors of B*, we can therefore use the scheme given in Table 6.1. 
The last column describes the operation that has to be performed in order to get the elements 
of the current row: e.g. in row n + 2, the notation ‘row 2 — (A2/A,) row k’ means that we have 
to take the corresponding element of row 2 (i.e. o? in the с column), and then we have to 
subtract ^2 / A; times the corresponding element of row k (i.e. од in ће c column) which gives 
the new element 62 in row n + 2 and the c column. The transformation formula above for all 
rows different from k is also called the rectangle formula since exactly four elements forming 
arectangle are required to determine the corresponding new element. This is illustrated in the 
scheme above for the determination of element 62, where the corresponding four elements 
in row 2 and row K are underlined. 


Table 6.1 Tableau for Steinitz's procedure 


Row Basis vectors a* с Operation 

1 p! 1 01 

b? M 02 

к x: : ; 
E. M ak 
n b” Àn Qn 
n+1 b! 0 Bi =a – дор row 1 — $1 rowk 
n+2 b? 0 Во = оз – Fax row 2 — 12 row k 
i А : n Ри m 
n+k a 1 Bk = X. їр tow К 
2 n к= An | Ал 

п b 0 Bn = On — О town —  towk 
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In columns 3 and 4, we have the corresponding scalars of vectors а“ and c in the linear 
combinations of the basis vectors of B (row 1 up to row л) and of the basis vectors of B* 
(row n 4- 1 up to row 2n). In particular, from the first n rows we get 


a! = Aib! +... Ab А,Ь" 
and 
с= ojb! 4 --- Бор +--+ + anb”. 
From the last n rows, we get the linear combinations 
a’ = Ob! +... + 0b! + 1a* + Op! +... + 0b” 


and 


Л 1 ар \ k dn 
c= (a, = ар |b +---+(— ја ++ = —ay | b^. 
(m - Has) (= uy — Pay 


If we have representations of several vectors with respect to basis B and we look for their 
corresponding representations in the new basis B*, we simply add some more columns 
(for each vector one column) in the above scheme, and by performing the same operations as 
indicated in the last column, we get all the representations with respect to the new basis В“. 
The above procedure of replacing a vector in the current basis is illustrated by the following 
example. 


Example 6.13 Let a basis В = {b!, b*, b?, b^) of the four-dimensional space IR^ with 


1 3 —2 —1 
1] —1 2 —5 3 1 4 _ 0 
b = 1 zd 2 ‚ b= _2 and b'— _1 
—2 —1 2 1 


be given. We do not prove here that these four vectors indeed constitute a basis in IR^. 
According to Definition 6.10, we have to solve a system of four linear equations with four 
variables which we treat later in Chapter 8 in detail. Moreover, let 


а? = 4b! — 2p? + 2p? — 6b* 
be the vector which enters the basis instead of vector b?. In addition, let vector c be given as 


c = b! + 2b? + 4b? + ЗЬ“. 


Notice that 
—4 
c= z. | (6.11) 


7 


250 Vectors 


2 3 


i.e. the representation of this vector by means of the unit vectors e!, е2, e?, e^ as basis vectors is 
c = —4e! — 7e? — бе? + 7e*. 
Applying the tableau given in Table 6.1 to find the linear combination of vector c of the new 
basis vectors, we obtain the results given in Table 6.2. From row 5 to row 8 of the c column, 
we get the representation of vector c by means of the basis B* = (b!, b?, аз, b^): 
с = —7b! + 6b? + 2a? + 151°. 


Table 6.2 The change of the basis in Example 6.13 


Row  Basisvectors b? c Operation 

1 p! 4 1 

2 b2 -2 2 

3 b? 2 4 

4 bt -6 3 

5 b! 0 —7 rowl-2row3 
6 b2 (0) 6 row2+row3 

7 a? 1 2 1 row 3 

8 bt 0 15 row4+3row3 


We can easily check that our computations are correct: 


1 3 0 zii -4 
2, —5 8 0 —7 

e-—| | [+61 5 [+2] [жер _, рер |. 
-2 E -8 1 7 


i.e. for vector c we get the same representation with respect to the basis vectors e! , e?, e?, e^ 


as before (see equality (6.11)). 


If a basis B = {b!, b?,. .., b") should be replaced by a basis B* = (al, а2,..., a", then we 
can apply consecutively the above procedure by replacing in each step a vector b', 1 < i < n, 
by a vector a’, 1 <j < n, provided that the assumption of Theorem 6.2 is satisfied. 


EXERCISES 


6.1 Given are the vectors 


a= 1 : b= —4 and c= 2 


6.2 


6.3 


6.4 


6.5 


6.6 


6.7 
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(a) Find vectors a + b — c, a + 3b, b — 4a + 2c, a+ 3(b — 2c). 

(b) For which of the vectors a, b and c do the relations > or > hold? 

(c) Find the scalar products aT - b, aT - e, bT - c. Which of the vectors a, b and c are 
orthogonal? What is the angle between the vectors b and c? 

(d) Compute vectors (aT - b) - e and a - (bT . c). 

(e) Compare number |b + c| with number |b| + |c] and number |b! - e| with number 
[b] - |e]. 


Find о and В so that vectors 


[2 
a= —1 апа b= 4 
a —2 


are orthogonal. 


(a) What is the distance between the following points: (1,2,3) and (4, —1, 2) in the 
three-dimensional Euclidean space IR?? 
(b) Illustrate the following sets of points in R?: a > b and |а| > |Ы. 


Given are the vectors 


MeL) uu. elt): 


Find out which of the vectors 


"LORI as) 


are linear combinations of a! and a”. Is one of the above vectors a convex combination 
of vectors a! and a?? Graph all these vectors. 


Given are the vectors 


i) n) t) 9 n) 


Show that vector a^ can be expressed as a convex linear combination of vectors a! , a 
and a°. Find the convex combinations of vectors a! , a? and a? graphically. 


2 


Are the vectors 


1 1 5 
al = А а= | -2 and а? = 4 
0 1 -2 


linearly independent? 


Do the two vectors 


eG) om n (D) 


span the two-dimensional space? Do they constitute a basis? Graph the vectors and 
illustrate their linear combinations. 
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6.8 Do the vectors 


1 0 0 1 

0 0 1 0 

op П 0 ang 1 

1 0 1 0 

constitute a basis in IR^? 
6.9 Let vectors 

1 0 1 
а = | 0 |, a=] 1 and а? = 0 
3 0 —1 


constitute a basis in ЁЗ. 


(a) Express vector 


E 
Il 
w 


as a linear combination of the three vectors a!, a? and a? above. 


(b) Find all other bases for the three-dimensional space which include vector a and 
vectors from the set {a!, a2, a3}. 
(c) Express vector 


5 
р = 2а! + 2а2 + 3а? = | 2 
3 


by the basis vectors a!, a? and a. 
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7.4 MATRICES 


We start with an introductory example. 


Example 7.1 Assume that a firm uses three raw materials denoted by №, R2 and R3 to 
produce four intermediate products 51, 52, 53 апа 54. These intermediate products are partially 
also used to produce two final products F, and F2. The numbers of required units of the 
intermediate products are independent of the use of intermediate products as input for the 
two final products. Table 7.1 gives the number of units of each raw material which are 
required for the production of one unit of each of the intermediate products. Table 7.2 gives 
the number of units of each intermediate product necessary to produce one unit of each of 
the final products. 

The firm intends to produce 80 units of S1, 60 units of 52, 100 units of 53 and 50 units of 
$4 as well as 70 units of F; and 120 units of F2. The question is: how many units of the 
raw materials are necessary to produce the required numbers of the intermediate and final 
products? 


Table 7.1 Raw material requirements for the intermediate products 


Raw material $1 S2 S3 S4 
Ri 2 1 4 0 
Ro 3 2 1 3 
R3 1 4 0 5 


Table 7.2 Intermediate product requirements for the 
final products 


Raw material Fi Fy 
$1 2 3 
S2 4 0 
S3 1 4 
S4 3 1 
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To produce the required units of the intermediate products, we need 
2-80+1-60+4-100+0-50 = 620 

units of raw material Ау. Similarly, we need 
3-80+2-60+1-100+3-50= 610 

units of raw material Ry and 
1-80+4-60+0-100+5-50=570 


units of raw material R3. Summarizing the above considerations, the vector y? of the required 
units of raw materials for the production of the intermediate products is given by 


620 
y = | 610 |, 
570 


where the kth component gives the number of units of Rg required for the production of the 
intermediate products. 
Next, we calculate how many units of each raw material are required for the production of 
the final products. Since the intermediate products are used for the production of the final 
products (see Table 7.2), we find that for the production of one unit of final product F the 
required amount of raw material А is 

2.2+1:4+4:1+0.3 = 12. 
Similarly, to produce one unit of final product F2 requires 

2-34+1-04+4-44+0-1=22 
units of А. To produce one unit of final product F, requires 


3-2+2-4+1-1+3-3=24 


units of R2. Continuing in this way, we get Table 7.3, describing how many units of each raw 
material are required for the production of each of the final products. 


Table 7.3 Raw material requirements for the final 


products 
Raw material Fi Fo 
Ri 12 22 
R5 24 16 


Matrices and determinants 255 


Therefore, for the production of the final products, there are 
12.70 + 22 - 120 = 3, 480 

units of raw material А, 
24-70 + 16-120 = 3,600 

units of raw material R2 and finally 
33 -70 + 8-120 = 3,270 


units of raw material Ёз required. The vector y" containing as components the units of each 
raw material required for the production of the final products is then given by 


3, 480 
y =| 3,600 
3,270 


So the amount of the individual raw materials required for the total production of the inter- 
mediate and final products is obtained as the sum of the vectors y? and y^. Denoting this 
sum vector by y, we obtain 


620 3,480 4,100 
у= | 610 |+ | 3,600 |= | 4,210 
575 3,270 3, 845 


The question is whether we can simplify the above computations by introducing some for- 
mal apparatus. In the following, we use matrices and define operations such as addition or 
multiplication in an appropriate way. 


Definition 7.1 А matrix A is a rectangular array of elements (numbers or other 
mathematical objects, e.g. functions) aj; of the form 


a11 412 5 din 

a21 422 >t Adn 
А (aij) = 

Gm] Am2 атп 


Any element (or entry) aj; has two indices, a row index i and a column index j. The 
matrix A is said to have the order or dimension m x n (read: m by n). 
If m — n, matrix A is called a square matrix. 


For a matrix A of order m x n, we also write A = Ас), or A = (aij) (mp), or simply 
A — (aij). 
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Definition 7.2 Let a matrix A of order m x n be given. The transpose АТ of matrix A 
is obtained by interchanging the rows and columns of A, 1.e. the first column becomes 
the first row, the first row becomes the first column and so on. Thus: 


И) А =(а}) with card forl<j<nandl<i<m. 


Obviously, matrix АТ in Definition 7.2 is of order n x m. 


Example 7.2 Let 


Since matrix A is of order 2 x 4, matrix AT is of order 4 x 2, and we get 


2 7 
4: c 
А= |, % 
1 4 


Remark А vector 


ai 
a2 


am 


is a special matrix with one column (i.e. a matrix of order m x 1). Analogously, a transposed 
vector a! = (a), a2,...,dm) is a special matrix consisting only of one row. 


Definition 7.3 Two matrices A and В of the same order m x n are equal if 
corresponding elements are equal, i.e. 


aj = by forl<i<m and l<j<n. 


So only for matrices of the same order m x n can we decide whether both matrices are 
equal. 


Definition 7.4 А matrix A of order n x n is called symmetric if A = АТ, i.e. equality 
ау = aj; holds for 1 < i,j < n. Matrix А is called antisymmetric if A = cp i.e. 
ay = —dji for 1 < ij <n. 
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As a consequence from Definition 7.4, we obtain: if A is antisymmetric, then we must have 
а = 0 fori = 152,024: 


Special matrices 


We finish this section with some matrices of special structure. 


Definition 7.5 А matrix D = (dj) of order n x n with 


р=|. . Lue c t oe 
RUE : 0 fo 1<i,j<nandi Zzj, 
Q @ sss oh 


is called a diagonal matrix. A diagonal matrix I = (ij) of order n x n with 


ms ee cei, [1 fr Izüjznadi-j 
EIE ud e= jo for 1 <i,j <nandi Zj 
@ @ эз» i 


is called an identity matrix. 


Definition 7.6 А matrix U = (uj) of order n x n with 


un U2 сс Un 

0 u22 8: Un я. . . . H 
U-zI|. : з , Le. ну =Oforl <ij <nandi >j 

0 0 Unn 


is called an upper triangular matrix. A matrix L = (Ij) of order n x n with 


ly 0 E y 
li ж ess | 

L= ||. Ў d , Le. у 
ln 1,2 MS Гая 


is called а lower triangular matrix. 


Notice that the matrices given in Definitions 7.5 and 7.6 are defined only in the case of a 
Square matrix. 
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Definition 7.7 A matrix О = (о) of order m x n with 


0 0 0 
D («4 ae @ 

@ к= ||. s ‚|, ieoj-0 forl<i<mandl<j<n 
0 @ see @ 


is called a zero matrix. 


7.2 MATRIX OPERATIONS 


In the following, we discuss matrix operations such as addition and multiplication and their 
properties. 


Definition 7.8 Let A = (а) and B = (bj) be two matrices of order m x n. The sum 
А + B is defined as the m x n matrix (а; + bij), i.e. 


A+ В = (aij) mn) + (Dij) ma) = (aij + bij) тл): 


Thus, the sum of two matrices of the same order is obtained when corresponding elements at 
the same position in both matrices are added. The zero matrix O is the neutral element with 
respect to matrix addition, i.e. we have 


А+О= О+А = А, 
where matrix О has the same order as matrix А. 
Definition 7.9 Let A = (aj) be an m x n matrix and à € IR. The product of the scalar 
à and the matrix A is ће m x n matrix AA = (Aaj), i.e. any element of matrix A is 


multiplied by the scalar A. The operation of multiplying a matrix by a scalar is called 
scalar multiplication. 


Using Definitions 7.8 and 7.9, we can define the difference of two matrices as follows. 


Definition 7.10 Let A = (aj) and B = (bj) be matrices of order m x n. Then the 
difference of matrices A and B is defined as 


uc num 
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Consequently, matrix 4 — B is given by the m x n matrix (aj — bj), i.e. 


A — B = (aij) ma) — (by) (mn) = (aij — bij) т): 


Example 7.3 Let 


3 1 2 12 0 1 2 5 
А= |0 -2 3], в= |3 1 -1 and С= |2 0 3 
1 4 5 4 2 -2 2 -3 1 
We compute 24 + 3B — C and obtain 
2.3 2-1 2-2 3.1 3-2 3-0 
2A4+3B—C=]2-0 2.(-2 2-3]4+]3-3 3.1 3-(-1) 
2-1 2-4 2-5 3.4 3.2 3.(—2) 
1 2 25 
-|2 0 3 
2 —3 1 
6 2 4 3 6 0 1 2 5 
=|0 -4 6]+1]19 3 -31-|2 O 3 
2 8 10 12 6 —6 2 —3 1 
8 6 —1 
=| 7 -1 0 
12 17 3 


Next, we give some rules for adding two matrices A and B of the same order and for multi- 
plying a matrix by some real number (scalar multiplication). Let A, B, C be matrices of order 
mxnandA,u ER. 


Rules for matrix addition and scalar multiplication 


(1) A-B-B-cA; 
(commutative law) 

(2) (4-B)-C — A (В+ С); 
(associative law) 

(3) A(A4 + B) = AA + AB; (A+ дА = AA + uA. 
(distributive laws) 


We have already introduced the notion of a vector space in Chapter 6. Using matrix addition 
and scalar multiplication as introduced in Definitions 7.8 and 7.9, we can extend the rules 
presented above and get the following result. 


THEOREM 7.1 The set of all matrices of order m х n constitutes a vector space. 
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Next, we introduce the multiplication of two matrices of specific orders. 


Definition 7.11 Let А = (aj) be a matrix of order m x p and B = (Буу) be a matrix 
of order p x n. The product AB is a matrix of order m x n which is defined by 


апр +--+ + аррр апр +: apbp > апп +--+ арб 

421031 +: + a2pbpi — a21b12 +: + aopbpo ... 21 Din + +++ + a2pbpn 
АВ = 

agb +-+: Amp bp Ami b12 +: Amp bp2 o атп o GmpDpn 


Notice that the product AB is defined only when the number of columns of matrix A is equal 
to the number of rows of matrix B. For calculating the product A(mp)Bip,n), we can use Falk 5 
scheme which is as follows: 


bi bi2 >- bin 
bj bo) >+- ban 

AB ; 
bp) bp bpn 
ар ар c: Qip | C1 су с Cin 
а] a `` Qp | ор CO ++ Cn 
Aml Am2 `` Amp | Cm| Cm2 ^^^ Ст 


P 
with су = Y акьу fori = 1,2,...,m and j=1,2,...,n. 
k=1 


From the above scheme we again see that element cj; is obtained as the scalar product of the 
ith row vector of matrix A and the jth column vector of matrix B. 


If we have to perform more than one matrix multiplication, we can successively apply 
Falk's scheme. Assuming that the corresponding products of n matrices are defined, we 
can (due to the validity of the associative law) perform the multiplications either starting 
from the left or from the right. In the former case, we obtain C = 4142543 · · · А, according 
to C = [(4142)43] -- - An, i.e. by using Falk's scheme repeatedly we obtain 


К аш NN а: 
А\ 14142 1414243 |---| C * 


or in the latter case С = A41[42... (4, 145)], ie. we obtain by using Falk’s scheme 
repeatedly: 


A2 | A243... An 
E C 
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Next, we discuss some properties of matrix multiplication. 


(1) Matrix multiplication is not commutative, i.e. in general we have AB # BA. It may even 
happen that only one of the possible products of two matrices is defined but not the other. 
For instance, let 4 be a matrix of order 2 x 4 and B be a matrix of order 4 x 3. Then the 
product AB is defined and gives a product matrix of order 2 x 3. However, the product 
BA is not defined since matrix В has three columns but matrix А has only two rows. 

(2) For matrices Am p), Вер) and Сп), we have A(BC) = (AB)C, i.e. matrix multiplication 
is associative provided that the corresponding products are defined. 

(3) For matrices A (mp), Bipn) and Con), we have A(B + C) = АВ AC, i.e. the distributive 

law holds provided that B and C have the same order and the product of matrices A and 

B+ Cis defined. 

The identity matrix J of order n x n is the neutral element of matrix multiplication of 

square matrices of order n x n, i.e. 


(4 


МУ 


AI = IA = А. 


Let A Бе а square matrix. Then we write AA = 242, and in general A” = АА... А, where 
factor A occurs n times, is known as the nth power of matrix A. 


Example 7.4 Let matrices 


3 —2 6 
А= |4 1 3 and B= 
5 4 0 


— BN 
new 


be given. The product BA is not defined since matrix B has two columns but matrix A has 
three rows. The product AB is defined according to Definition 7.11, and the resulting product 
matrix C = AB is of the order 3 x 2. Applying Falk’s scheme, we obtain 


2 3 
AB 4 1 
1 5 

3 -2 6] 4 37’ 
4 1 3 [15 28 
5 4 0|26 19 


i.e. we have obtained 


4 37 
C—AB-|15 28 
26 19 


Example 7.5 Three firms 1, 2 and 3 share a market for a certain product. Currently, firm 1 
has 25 per cent ofthe market, firm 2 has 55 per cent and firm 3 has 20 per cent of the market. 
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We can summarize this in a so-called market share vector s, where component s; is a real 
number between zero and one giving the current percentage of firm i as a decimal so that 
the sum of all components is equal to one. In this example, the corresponding market share 
vector s = (51, 52,53) 1 is given by 


0.25 
s= | 0.55 
0.20 


In the course of one year, the following changes occur. 


(1) Firm 1 keeps 80 per cent of its customers, while losing 5 per cent to firm 2 and 15 per cent 
to firm 3. 

(2) Firm2 keeps 65 per cent of its customers, while losing 15 per cent to firm 1 and 20 per cent 
to firm 3. 

(3) Firm3 keeps 75 per cent of its customers, while losing 15 per cent to firm 1 and 10 percent 
to firm 2. 


We compute the market share vector s* after the above changes. To do this, we introduce 
a matrix Т = (tj), where fj is the percentage (as a decimal) of customers of firm j who 
become a customer of firm i within the next year. Matrix T is called a transition matrix. 
In this example, matrix T is as follows: 


0.80 0.15 0.15 
0.05 0.65 0.10 
0.15 0.20 0.75 


To get the percentage of customers of firm 1 after the course of the year, we have to compute 
si = 0.8051 + 0.1552 + 0.1553. 


Similarly, we can compute the values 55 and s3, and we find that vector s* is obtained as the 
product of matrix T and vector s: 


0.80 0.15 0.15 0.25 0.3125 
s*=Ts=|0.05 0.65 0.10 0.55 | = | 0.3900 
0.15 0.20 0.75 0.20 0.2975 


Hence, after one year, firm 1 has 31.25 per cent of the customers, firm 2 has 39 per cent and 
firm 3 has 29.75 per cent. 


Example 7.6 Consider again the data given in Example 7.1. Introducing matrix R5. 3,4) а5 
the matrix giving the raw material requirements for the intermediate products as in Table 7.1 
and matrix 57, 4,2) 88 the matrix ofthe intermediate product requirements for the final products 
as in Table 7.2, we get the raw material requirements for the final products described by 
matrix RB 2) by matrix multiplication: 


F S F 
R35, = R4) $45. 
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Let vectors x% 1) and хб D give the number of units of each of the intermediate and final 
products, respectively, where the ith component refers to the ith product. Then we obtain the 
vector y of the total raw material requirements as follows: 
5 Е 
У(3,1) = Yg, ) + Ya, 
_ ps S F F 
= Кд) "Xa + Ко) XD 


S S S F F 
= R4): Xa) + R,9 $42) ХО): 


The indicated orders of the matrices confirm that all the products and sums are defined. 


We now return to transposes of matrices and summarize the following rules, where A and В 
are m x n matrices, C is an n x p matrix and à є R. 


Rules for transposes of matrices 

а) 4" =4; 

(2) (4+ B)! = AT BT, (4 — B)" = AT — ВТ; 
(3) (24) = А47; 

(4) (Ac)? = СТАТ, 


Definition 7.12 А matrix А of order n x n is said to be orthogonal if АТА = І. 


As a consequence of Definition 7.12, we find that in an orthogonal matrix A, the scalar 
product of the ith row vector and the jth column vector with i 52 j is equal to zero, i.e. these 
vectors are orthogonal (cf. Chapte 6.2). 


Example 7.7 Matrix 


Ee 
2У3 2: 


is orthogonal since 


PAM Өү р 
z 1 1 1 ~\o 1 ZL 
Aye LJUS "d 


2 


7.3 DETERMINANTS 


Determinants can be used to answer the question of whether the inverse of a matrix exists 
and to find such an inverse matrix. They can be used e.g. as a tool for solving systems of 
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linear equations (this topic is discussed in detail in Chapter 8) or for finding eigenvalues (see 
Chapter 10). 


Let 
aj] 12 din 
421 422 an 
A= . 
аһ аһ аһ 


be a square matrix and 4; denote the submatrix obtained from А by deleting the ith row and 
jth column. It is clear that 4j; is a square matrix of order (n — 1) x (n — 1). 


Definition 7.13 The determinant of a matrix A of order n x n with numbers as 
elements is a number, assigned to matrix A by the following rule: 


n 
det A = |A| = У (-1)/*! - ay 141. 
j=l 


For n = 1, we define |A| = a11. 


Whereas a matrix of order m x n is a rectangular array of m - n elements, determinants 
are defined only for square matrices and in contrast to matrices, a determinant is a number 
provided that the elements of the matrix are numbers as well. According to Definition 7.13, 
a determinant of a matrix of order n x n can be found by means of n determinants of matrices 
of order (n — 1) x (n — 1). The rule given in Definition 7.13 can also be applied when the 
elements of the matrices are e.g. functions or mathematical terms. 


For n = 2 and matrix 


|4| = a11a22 — a12a21. 


For n — 3 and matrix 


|4| = an · [411 — 212 : |412| + a13 * 1413] 


421 422 
431 432 


„122 423 
432 азз 


‚|аз1 423 
a31 433 
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= a11 (422433 — 432423) — а12(а21азз — 431423) + а13 (021432 — 431422) 


411022033 + 412423431 + 413421432 — 411423432 — 412821033 — 413422431. 


The latter computations for n — 3 can be done as follows. We add the first two columns at 
the end as fourth and fifth columns. Then we compute the products of the three diagonals 
from the left top to the right bottom and add them, and from this value we subtract the sum 
of the products of the three diagonals from the left bottom to the right top. This procedure is 
known as Sarrus 5 rule (see Figure 7.1) and works only for the case n — 3. 


Determinants of square submatrices are called minors. The order of a minor is determined 
by its number of rows (or columns). A minor |Aj;| multiplied by (—1)/ is called a cofactor. 
The following theorem gives, in addition to Definition 7.13, an alternative way of finding 
the determinant of a matrix of order n x n. 


r Tad u^ 
4 a £ 
4 РА 4 
ал11 012 Q13 Q11 012 
b м £ м 4 ш 
` Nu T£ ГА 
x ZN x 4 
* 4 ^ a x y» 
a2 a22 Q23 Q21 022 
ГА х Pd Я P4 м 
^ MA y ` 
"d КАЧ М м. 
“ e hl # x x 
аз 32 Q33 Q31 Q32 
~ ` М 
` ` ^ 


Figure 7.1 Sarrus's rule. 


THEOREM 7.2 (Laplace's theorem, cofactor expansion of a determinant) Let A be a 
matrix of order n x n. Then the determinant of matrix A is equal to the sum of the products 
of the elements of one row or column with the corresponding cofactors, i.e. 


n 
|A| = Ус) ау 1А] (expansion of a determinant by row i) 
j=! 
and 
n 
|A| = ус) - ajj 1А] (expansion of a determinant by column j). 
i=1 


Theorem 7.2 contains Definition 7.13 as a special case. While Definition 7.13 requires 
cofactor expansion by the first row to evaluate the determinant of matrix A, Theorem 7.2 
indicates that we can choose one arbitrary row or column of matrix A to which we apply 
cofactor expansion. Therefore, computations are simplified if we choose one row or column 
with many zeroes in matrix A. 
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Example 7.8 We evaluate the determinant of matrix 


2 3 5 
A={ 1 02 
—4 2 


by applying Theorem 7.2 and performing cofactor expansion by the second column. We get 


— — 3. . 1 2 — 4. H 2 
Al = C1? :3 io j^ D*-0 E d 


5 
1 2 
= (—3) : [2 — (—2)] +0 +4- (4— 5) = -12+ 0 — 4 = —16. 


ау | ; 


According to Theorem 7.2, the first determinant of order 2 x 2 on the right-hand side of the 
first row above is the minor |412| obtained by crossing out in matrix A the first row and the 
second column, i.e. 


[412] = 


—-1 2" 


a» 423 
азі 433 


E 


Accordingly, the other minors 42? and 432 are obtained by crossing out the second column 
as well as the second and third rows, respectively. 


We now give some properties of determinants. 
THEOREM 7.3 Let A bean n x n matrix. Then |A| = |4T|. 


This is a consequence of Theorem 7.2 since we can apply cofactor expansion by the elements 
of either one row or one column. Therefore, the determinant of matrix A and the determinant 
of the transpose A! are always equal. In the case of a triangular matrix, we can easily evaluate 
the determinant, as the following theorem shows. 


THEOREM 7.4 Let A bean n x n (lower or upper) triangular matrix. Then 
n 
|A| = ап -a22 ·...: Ann = П. 
i=1 


As a corollary of Theorem 7.4, we find that the determinant of an identity matrix J is equal 
to one, i.e. |Z| = 1. 


If we evaluate a determinant using Theorem 7.2, it is desirable that the determinant has an 
appropriate structure, e.g. computations are simplified if many elements of one row or of 
one column are equal to zero. For this reason, we are looking for some rules that allow us to 
evaluate a determinant in an easier form. 


THEOREM 7.5 Let A be ann x n matrix. Then: 


(1) If we interchange in A two rows (or two columns), then we get for the resulting matrix 
А |4*| = —|А|. 
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(2) If we multiply all elements of a row (or all elements of a column) by à є R, then we get 
for the resulting matrix A*: |4*| = à - |А|. 

(3) If we add to all elements of a row (or to all elements of a column) A times the corre- 
sponding elements of another row (column), then we get for the resulting matrix А*: 
|A*| = 141. 


COROLLARY 7.1 For the n x n matrix B = AA, we obtain: |B| = |AA| = A” - JA]. 


The latter corollary is obtained by a repeated application of part (2) of Theorem 7.5. 


THEOREM 7.6 Let A and B be matrices of order n x n. Then |AB| = |A| - |B]. 


It is worth noting that in general |A + В| Z |A| + 18|. Next, we consider two examples of 
evaluating determinants. 


Example 7.9 We evaluate the determinant of matrix 


1 —2 4 3 
=2 5 —6 4 

4=| E E" 
4 —1 20 10 


We apply Theorem 7.5 to generate a determinant having the same value, in which all elements 
are equal to zero below the diagonal: 


1 -2 4 3| ПП —2 4 3) W -2 4 3 
4272 5 —6 4| |0 1 2 10 Jo 1 2 10 
К=з .4 16 sl—lo 24 -4^7|0 0 0 —4 
4 — 20 10) |0 -3 4 -2| |0 0 10 28 
1—2 4 3 
0 1 2 10 
=—lo о 10 28| 240. 
0 0 0 —24 


In the first transformation step, we have generated zeroes in rows 2 to 4 of the first column. 
To this end, we have multiplied the first row by 2 and added it to the second row, yielding 
the new second row. Analogously, we have multiplied the first row by —3 and added it to the 
third row, and we have multiplied the first row by —4 and added it to the fourth row. In the 
next transformation step, we have generated zeroes in rows 3 and 4 of the second column. 
This means we have multiplied the second row (of the second determinant) by —2 and added 
it to the third row, and we have multiplied the second row by 3 and added it to the fourth row 
(application of part (3) of Theorem 7.5). Additionally, we have interchanged rows 3 and 4, 
which changes the sign of the determinant. Finally, we applied Theorem 7.4. 
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Example 7.10 We want to determine for which values of ¢ the determinant 


3 NE: 
ldj2|2420 0 4 
1 2-0 


is equal to zero. We first apply expansion by column 3 according to Theorem 7.2 and obtain 


2+2t 0 3 1 3 Q 


412.) 12-1 1 2-1 2+2t 0 


ZH 


+0. 


—2.[Q4-20-(2—0]—4-(6—3t — 1) = –42 + 16t — 12. 


From |A| — 0, we obtain 
—4P? +162 – 12 = 0 
which corresponds to 
2 4+3=0. 


This quadratic equation has the two real roots / = 1 and t; = 3. Thus, for t; = 1 and h = 3, 
we get |A| = 0. To find the value of |A|, we did not apply Theorem 7.5. Using Theorem 7.5, 
we can transform the determinant such that we have many zeroes in one row or column 
(which simplifies our remaining computations when applying Theorem 7.2). Multiplying 
each element of row 1 in the initial determinant by —2 and adding each element to the 
corresponding element of row 2, we obtain 


3 1 2 
l4j2i-442t -2 ol. 
1 2—0 


In this case, when expanding by column 3, we have to determine the value of only one 
subdeterminant. We get 


—4+2t —2 


и=2| p 


which is equal to the already obtained value —4/ + 16t — 12. 


The following theorem presents some cases when the determinant of a matrix A is equal to 
Zero. 


THEOREM 7.7 Let A be a matrix of order п x n. Assume that one of the following 
propositions holds: 


(1) Two rows (columns) are equal. 
(2) All elements of a row (column) of A are equal to zero. 
(3) A row (column) is the sum of multiples of other rows (columns). 


Then |A| = 0. 
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We next introduce the notions of a singular and a regular matrix. 


Definition 7.14 А square matrix A is said to be singular if |A| = 0 and regular (or 
non-singular) if |A| 4 0. 


We now consider a first possibility to solve special systems of linear equations. This approach 
is named after the German mathematician Cramer and uses determinants to find the values 
of the variables. 


Cramer’s rule 


Let 
Апл) ` Xi) = bon) 
be a system of linear equations, i.e. 


a11X1 + a12X2 + +++ + ах = by 


а21х1 + a22X2 + +++ + a2nXn = b2 


ап1Х\ + An2xX2 + +++ + AnnXn = bn, 


and we assume that A is regular (i.e. |A| 4 0). Moreover, let 4;(b) denote the matrix which 
is obtained if the jth column of A is replaced by vector b, i.e. 


а 412 ... a1,j-1 bi a1 j+1 «+. Alyn 
a) an ... G23j-1 b2 ајр ... Gan 
|4;(b)| = : : ; 
An) Am ... dGaj-| bn ајр ... am 
Then 
(4j (b)| А 
ху = | forj = 1,2,...,n 


is the unique solution of the system Ax = b of linear equations. 


Cramer’s rule makes it possible to solve special systems of linear equations. However, this 
rule is appropriate only when the determinant of matrix A is different from zero (and thus 
a unique solution of the system of linear equations exists). It is also a disadvantage of this 
method that, if we obtain |A| = 0, we must stop our computations and we have to apply some 
more general method for solving systems of linear equations, as we will discuss in Chapter 8. 
Moreover, from a practical point of view, Cramer’s rule is applicable only in the case of a 
rather small number of n. 
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Example 7.11 Consider the system of linear equations 
Зх + 4x; + 2x; =1 
X| — x? — 3x3 =7 


2х1 + хз = 4 


which we solve by applying Cramer’s rule. We first evaluate the determinant of matrix 


3 2 
А= |1 -1 -3 
2 1 


3 4 2 
4 2 3 2 3 4 
a= -1 E =з -oh h | 


=2.(—12+2)—0+1-(—3—4) =—27. 


In the above computations, we have decided to expand by row 3 since there is already опе 
zero contained in this row and therefore we have to evaluate only two minors of order two. 
For this reason, one could also choose expansion by column 2. Since |A| 4 0, we know now 
that the given system has a unique solution which can be found by Cramer’s rule. Continuing, 
we get 


14 2 
410) = 17 —1 —3 =4.| 4 je Ej 
4 0 1 
—4.(—12--2) +1: (-1— 28) = —69; 
3 1 2| |0 -20 11 
14200) = |1 7 -3|2| 7 —3 =з E 
2 1| |0 -10 7 


= —1 . (—140 + 110) = 30 
and 


3^ 41 
1 1 
lda(b)- 1 —1 7 =-4-[) je» i 
2 04 


= —4. (4—14) — 1 - (12 — 2) = 30. 


For finding |4 (b)| and |43(b)|, we have used Theorem 7.2. In the former case, we have 
again applied expansion by row 3, and in the latter case, we have applied expansion by 
column 2. For finding |A2(b)|, we have first used Theorem 7.5, part (3). Since there are no 
zero elements, we have transformed the determinant such that in one column or row (in our 
case column 1) all but one elements are equal to zero so that the application of Theorem 7.2 
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is reduced to finding the value of one minor of order two. By Cramer’s rule, we get 


| dbi 69 23 


AUA 27 9’ 

|40 | | 30 10 
д 7 9/ 

143(b)| 30 10 
=“ A 7 9 


7.4 LINEAR MAPPINGS 


Definition 7.15 A mapping A : R” — R” is called linear if 
A(x! + x?) = A(x!) + A(x’) for all x', x? є R” 
and 


А(Ах) = АА(х) for all A є R and x є R”. 


A linear mapping is therefore defined in such a way that the image of the sum of two vectors 
is equal to the (vector) sum of the two images, and the image of the multiple of a vector is 
equal to the multiple of the image of the vector. 


A linear mapping 4 : IR" — IR" can be described by means of a matrix A = (aj) of order 
m x n such that 


X1 yı ai a12 din X1 

X2 P y2 a21 a22 an X2 » 
x= ce R” — y= = | eR 

Xn Ym @т1 Am ^^^ атп Xn 


Definition 7.16 The set of all n-dimensional vectors x which are mapped by А: 
R” — R” into the m-dimensional zero vector 0 is called the kernel of the mapping, 
abbreviated ker A, i.e. 


ker A = Kan em" | A (mn) ` X(n,1) = 0,1). 


The kernel of a linear mapping is also called null space. Determining the kernel of a linear 
mapping requires the solution of a system of linear equations with the components of vector 
x as unknowns, which we will treat in detail in Chapter 8. The following theorem shows how 
a composition of two linear mappings can be described by a matrix. 
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THEOREM 7.8 LetB: R” — R?andA : К° — R” be linear mappings. Then the composite 
mapping 4 o B : IR" — R” is a linear mapping described by matrix 


Conn) = Aims) ` В): 


Example 7.12 Assume that a firm produces by means of q raw materials R1, R2, . . . , Rq the 
m intermediate products 51,.52,... , Sm, and with these intermediate products and with ће q 
raw materials the n final products F1, F5,..., Fn. Denote by 


r; the number of units of raw material R; which are necessary for the production of one 
unit of intermediate product S;, 

S, the number of units of intermediate product S; which are necessary for the production 
of one unit of final product Р, 

rj; Ше number of units of raw material R; which are additionally necessary for the 

production of one unit of final product Ру. 


We introduce the matrices А = (5) of order q х m, SF = (sj ) of order m x n and 
RF = CA ) of order q x n and denote by x” = (хЃ, х?',...,хЁ)Т the production vector of 


the final products and by x? = (x? ‚х9 -— ‚хЎ) the production vector of the intermediate 
products. We want to determine the required vector y of raw materials. First, raw materials 
according to the matrix equation 


1 _ pF F 
Ya, = Кп) ` Xo) 


are required for the final products. Moreover, we get for vector х the following matrix 
equation: 


S F F 
Холу = S(ma) ` X(n)» 


and for the production of intermediate products given by vector х, the required vector y? of 
raw materials is given by 


2 _ ps S _ ps F F 
Ya) = Кт) ` Хтл) = Кт) ` т) ` Xo 


Thus, we get the following relationship between the g-vector y of required raw materials and 
the n-dimensional vector x^: 


1 2 
Ya) = Y) + Yin 
= (RF RB SP) xoay 


i.e. RF + К o S represents a linear mapping from the n-space IR^ into the g-space Ri. This 
linear mapping can be described in the following way: 


x” ERY > (RF + Ro SPX) = RY ex” + RS. SP x =ує В, 


i.e. by this linear mapping a feasible n-dimensional production vector of the final products 
is mapped into a q-dimensional vector of required raw materials. 
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Next, we introduce the inverse mapping of a linear mapping. 


THEOREM 7.9 Let A : R” — R” be a linear mapping described by a matrix A of order 
пхп, 1.е. 


x1 yı 
x2 У2 

x= є R” +> Ах = у = И є В", 
Xn Yn 


yı X1 
y2 X2 

y= . є В" > Ау = х = . є В". 
Уп Xn 


Obviously, the composite mapping A o 47! = AT! o A is the identical mapping J. 


7.5 THE INVERSE MATRIX 


Definition 7.17 Given is a square matrix A. If there exists a matrix A~! such that 
AA | = АЗА = 1, 


then we say that AT! is an inverse or inverse matrix of A. 


We note that the inverse A~! characterizes the inverse mapping of a linear mapping described 
by matrix A. The following theorem answers the question: under which condition does the 
inverse of a matrix A exist? 


THEOREM 7.10 Let A be a matrix of order n x n. Then: 


(1) If matrix A is regular, then there exists a unique inverse matrix A^! . 
(2) If matrix A is singular, then A does not have an inverse. 


If the inverse A~! of matrix A exists, we also say that matrix A is invertible. According to 
Theorem 7.10, a square matrix A is invertible if and only if |A| Z 0. 
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Solving equations by matrix inversion 


Consider the matrix equations AX = B and YA = C, where matrix B of order n x m and 
matrix C of order m x n are given. The matrices X and Y are assumed to be unknown. From 
the above equations, it follows that matrix X has the order n x m and matrix Y has the order 
m x n. For |A| 4 0, the inverse of matrix A exists and we get 


AX=B c X-AB 


YA- C < Y-c4A.. 


The equations on the right-hand side are obtained by multiplying in the former case, equation 
AX = B from the left by A^! and in the latter case, equation YA = C from the right by A^ !. 
Remember that matrix multiplication is not commutative. 


Example 7.13 Let the matrix equation 
АХ = Х(2В — А) + 3(4 + X) 


be given, where A, B and X are n x n matrices. We solve the above equation for matrix X 
and obtain: 


АХ = 2XB — XA РЗА + 3X 
X —2XB + XA = ЗА 
X(I — 2B + A) = ЗА 

X = 34 —2B + A). 


In the second to last step, we have factored out matrix X from the left and we have used 
X = XI. Thus, if the inverse of matrix / — 2B + A exists, matrix X is uniquely determined. 


The following theorem presents a first possibility of computing the inverse of a matrix. 


THEOREM 7.11 Let А be a regular matrix of order n x n. Then the inverse matrix AT! is 
given by 


Fete [Aaa +- Rp 

32 (ena) e ipo fal co eal 
EI EI : ; 

441] +14 ce +|Ann| 


The matrix ((— DY. Ail)" is the transpose of the matrix of the cofactors, which is called 
the adjoint of matrix A and denoted by adj(A). 


To determine the inverse of a matrix A of order n x n, the evaluation of a determinant of 
order n x n and ofn? minors of order n — 1 is required. Thus, with increasing order of n, the 
application of Theorem 7.11 becomes rather time-consuming. 
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Example 7.14 We consider the matrix 
12 -1 

A= 2 1 0], 

-1 0 1 


and we want to determine the inverse 4^! of matrix A. In order to apply Theorem 7.2, we 
first evaluate the determinant of A: 


E ug 1 2 -1 © 
|А\=| 2 1 0|=[2 1 o=- a=- 
c 0 "i5 02 0 


In the above computations, we have first added rows 1 and 3 and then applied cofactor 
expansion by column 3. Calculating the minors, we obtain 


1 0 2 0 1 
Mule y 55 Mmn-|1] ea 11 = |1 d- 5 
2—1 Led 2 
4n = |o le ЖЫК =% Mzi=]|_ |22 
2—1 p 12 
|431| p 1 Д са 1; [432] —, 2 Д = 2; [433] um 1 = —3. 

With the above computations, we get the inverse matrix 

1 [Aul =al 1431] 1-2 1 
4'-qgp|- Mb Mal -Мш}р=—,[-2 0 -2 
[413] —|423| 1433 1 -2 -3 

1 1 1 

424 

1 1 

= = Му = Ge 

2 2 

1 1 3 

42 4 


Example 7.15 Consider the matrix equation 


АХ =X – В 


with 


3 a 
Ae. | 


where a є К. We want to determine X provided that this matrix is uniquely determined. 
Replacing X by IX, where J is the identity matrix, and solving the above matrix equation for 
X, we obtain first 


Су =В 
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and then 
X=(1-A)'B 


with 


—2 -a 
ra (52 5). 
To check whether the inverse of matrix J — A exists, we determine 


—2 -a 


Lio _g| = 10- 10a = 10(1 а). 


1-4) 


For a Æ 1, we have |I — A| 4 0, and thus the inverse of J — A exists and the given matrix X 
is uniquely determined. On the contrary, for a = 1, the inverse of J — A does not exist, and 
thus matrix X is not uniquely determined. We continue and obtain for a Æ 1 


euo 1 —5 а 
елап eio E 


By multiplying matrices (Z — A)~! and B, we finally obtain 


5 + 2a a 
om 1 -5-2a aX | 100-a) 100 —a) 
7 10(1 — a) 14 =>) ME 7 1 


5(1—а) 5(1—а) 


We now summarize some rules for operating with the inverses of matrices, assuming that the 
matrices A and B are of order n x n and that the inverses A^! and B^! exist. 


Rules for calculations with inverses 


(1) 47)! = 4; 
(2) 4D! = Ga); 
(3) (4B)! = В-!4-!; 


(4) 0A! =a (у € RA (0); 


(5) 4 = а 
EU 


We prove the validity of rule (5). Using |/| — 1 and Theorem 7.6, we obtain 
1= || = 14471 = 141-1471] 

from which rule (5) follows. As a generalization of rule (3), we obtain 
(4) 1 = (47)"  foralineN. 


We have seen that, if the order of the matrix is large, the determination of the inverse of 
the matrix can be rather time-consuming. In some cases, it is possible to apply an easier 
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approach. The following theorem treats such a case, which occurs (as we discuss later) in 
several economic applications. 


THEOREM 7.12 Let C bea triangular matrix of order n x n with cj; = 0 fori = 1,2,...,n 
and A = I — C, where Г is the identity matrix of order n x n. Then the inverse 4^! is given by 


J SCS) =1+С+С?+ +С". 


The advantage of the formula presented in Theorem 7.12 over the formula given in 
Theorem 7.11 is that the determination ofthe inverse is done without using determinants. The 
latter formula uses only matrix multiplication and addition, and the matrices to be considered 
contain a lot of zeroes as elements. 


7.6 AN ECONOMIC APPLICATION: INPUT-OUTPUT MODEL 


We finish this chapter with an important application of matrices in economics. Assume that 
we have a set of n firms each of them producing one good only. Production of each good 
j requires an input of aj units of good i per unit of good j produced. (The coefficients aj 
are also known as input-output coefficients.) Production takes place with fixed techniques 


(i.e. the values aj; do not change). Let x = (х1,х2,... , X5). be the vector giving the total 
amount of goods produced, let matrix A = (aj) of order n x n be the so-called technology or 
input-output matrix and let y = (y1,y2, ..., Yn)! be the demand vector for the use of the n 


goods. Considering the ith good, there are ajjx; units required as input for the production of 
xj units of good j, and y; units are required as final customer demand. Therefore, the amount 
x; of good i has to satisfy the equation 


Xi = апхі + арх +--+ + dinXn Бу, i= 1,2,...,п. 
Expressing the latter п equations in matrix notation, we get the equation 
x=Ax+y 
which can be rewritten as 
Ix — Ax = (I — А)х = y, 
where J is the identity matrix of order n x n. The above model expresses that vector x, giving 
the total output of goods produced, is equal to the sum of vector Ax describing the internal 
consumption of the goods and vector y representing the customer demand. The model is 
referred to as an input-output or Leontief model. The customer demand vector y is in general 
different from the zero vector, and in this case we have an open Leontief model. The equation 
@—4А)х=у 
represents a linear mapping К” — К” described by matrix J — A. If the total possible output x 


is known, we are interested in getting the possible amount y of goods left for the customer. 
Conversely, a customer demand vector y can be given and we ask for the total output vector x 
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required to satisfy the customer demand. In the latter case, we have to consider the inverse 
mapping 

x= (1 А) ly. 


So we have to determine the inverse of matrix J — A to answer this question in the latter case. 


Example 7.16 Consider a numerical example for an open input—output model. Let matrix 
A and vector y be given as follows: 


1 1 0 
5 5 2 
A= AR Ne and у= | 0 
5 5 5 1 
3 2 1 
5 5 5 
Then we get 
4 1 0 
3 1 4 —1 0 
р SE = _~_|=-—]-2 2 —1 
5 5 5 -3 —2 4 
3 2 4 
5 5 5 
Setting 
4 -1 0 
B-|-2 2 —1|, 
—3 -2 4 


we have / — A = B/S. Instead of inverting matrix J — A, we invert matrix B, which has only 
integers, and finally take into account that matrix equation (J — A)~! = 5 - B^! holds. First, 
we obtain |B| = 13 and thus the inverse of matrix B exists. Applying Theorem 7.11, we get 


,(6 41 
Bi--—[n 16 4 
33110 11 6 
Then we get 
;í(9 41 
d—A)!=5B'=—]11 16 4 
13 10: 11 6 


х= — |11 16 4 0 | = 26 | = | 10 
26 10 
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Figure 7.2 Relationships between raw materials and products in Example 7.17. 


Next, we consider a second example of this type of problem. 


Example 7.17 А firm produces by means of four raw materials А, R?, Кз and R4 five 
products P1, P5, Рз, Рд and P5, where some of these products are also used as intermediate 
products. The relationships are given in the graph presented in Figure 7.2. The numbers 
beside the arrows describe how many units of raw material Ry and product P;, respectively, 
are necessary for one unit of Pj,j = 1,2,...,5. Vector x = (X1,X2,X3,X4, X5)! describes 
the produced units (total output) of product P; and y = (y1,y2,y3, уд, ys)! denotes the final 
demand (export) for the output of the products P;. 


(1) We first determine a relationship between vectors x and y. Let the technology matrix 


01 00 1 
000 12 
4=|0 002 0 
0000 1 
000 0 0 
be given. Then 
x=Ax+y 


or, correspondingly, 


(1 – А)х = у. 
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In detail, we have 


1 -1 0 0 —1 ХІ yı 
0 10-1 -2 x2 y2 
0 0 1 -2 0 X3 = | уз 
0 0 0 1 -1 х4 y4 
0 0 0 0 1 X5 ys 


(2 We now calculate the final demand when the total output is given by 
x = (220, 110, 120, 40, 20)T. We obtain 


1 —1 0 -1 220 90 
0 10-1 -2 110 30 
y=(I1—A)x =] 0 0 1 -2 0 120 |=] 40 
0 0 0 1 -1 40 20 
0 0 0 0 1 20 20 
(3) Let y = (60, 30, 40, 10,20)" be the given final demand vector. We determine the pro- 


duction vector x and the required units of raw materials for this case. To this end, we 
need the inverse of matrix J — A. Since J — A is an upper triangular matrix with all 
diagonal elements equal to zero, we can apply Theorem 7.12 for determining (I — 4)! , 
and we obtain 


@— 4А)! = 1+ А-+ А? + А? + А°. 


Using 
000 12 0 0 00 1 
0.0 00 1 00000 
42|[00002], 4=]0 0000 
00000 00000 
00000 00000 
we get that 4^ is the zero matrix and therefore 
1 10 14 
0 1.0 1 3 
qQ-45!2|00 122 
0.0 0 1 1 
0 0 00 1 
Then we get 
1 10 14 60 180 
0 10 1 3 30 100 
х=(1-4)!у=|0 0122 40 |= | 100 
0.00 1 1 10 30 
0 0 0 0 1 20 20 
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For the required vector of raw materials г = (r1, ғо, r3, r4). , where r; denotes the quantity 
of required units of raw material R;, we obtain from Figure 7.2: 


ry = 4x 
n = 2x, + + 3x5 
r3 = 2x2 + x3 + 3x4 
YA = 2x3 
i.e. 
п 4000 0\( 7 
x2 
r2 2 0 0 0 3 
r= = X3 
r3 02 1 3 0 > 
r4 00200 E 
5 


For the determined vector x = (180, 100, 100, 30, 20)", we get the following vector r: 


т 720 
|е |_| 420 
з | | 390 
r4 200 
EXERCISES 


7.1 


7.2 


7.3 


Given are the matrices 


2 1 
= (0 i | B=| 1 2); 
-] 0 


1 0 -1 3 1 0 
бе, 1 s and D=, 2 3 
(a) Find the transposes. Check whether some matrices are equal. 


(b) Calculate A + D, A — D, AT — B and C — D. 
(c) Find A + 3(87 — 2D). 


Find a symmetric and an antisymmetric matrix so that their sum is equal to 
2 -1 0 
А= |1 1 -1 
3 0 -2 


4 2 еі 
e 4-( 3): a 6 3 


с 
— 
a 
ll 
i IN 
N Б 
чл w 
O U 
— UJ 
UBER. 
сз 
ll 
wp Re 
BRON WN 
сл МӘ ЧЧ) о 
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—3 
6 
(с) 4 = (2 3 4 5): В = E Н 
2 
2 3 
2 —1 3 
4 2 6 1 2 
x 
2 3 1 
e 4-(i 5 3 ac : 


7.4 Use the matrices given in Exercise 7.3 (d) and verify the equalities 
AT BT = (BA)! and ВТАТ = (АВ). 


7.5 Given are the matrices 
1 —1 
2 —2 0 1 0 
А= |7 —4|; В = 0 and | 030 a 
2 7 


(a) Find the dimension of a product of all three matrices if possible. 
(b) Test the associative law of multiplication with the given matrices. 


7.6 Calculate all powers of the following matrices: 


02 8 1 

007 3 cosa sino 
@) 4= ооо 5P (b) Per ed 

0000 


7.7 A firm produces by means of two raw materials № and R» three intermediate products 
$1, S2 and $5, and with these intermediate products two final products Ру and F5. The 
numbers of units of Ri and R3 necessary for the production of 1 unit of Sj, є {1, 2, 3}, 
and the numbers of units of S1, 52 and $5 necessary to produce 1 unit of F and Fo, 
are given in the following tables: 


кә: 5 25 |6 0 
R,| 2 3 5 P a 
R3|l5 4 1 5з T 


Solve the following problems by means of matrix operations. 


(a) How many raw materials are required when 1,000 units of F; and 2,000 units of 
Е) have to be produced? 

(b) The costs for one unit of raw material are 3 EUR for Rı and 5 EUR for R2. 
Calculate the costs for intermediate and final products. 
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7.8 Given is the matrix 


7.9 


7.12 


2.3 0 
A=]|-1 2 4 
05 1 


(a) Find the submatrices 412 and 422. 

(b) Calculate the minors |412| and |A29|. 

(c) Calculate the cofactors of the elements a11, a21, a31 of matrix A. 
(d) Evaluate the determinant of matrix A. 


Evaluate the following determinants: 


1200 1012 
2 1 6 

212 0 12 34 

00 2 1 2.140 
2 7 4 1 —1 2 4 3 н: a 
3 1 4 0 2 —4 -8 ME. NE 
(d) 5100 (e) 7 1 5 ME (f)3 3 0 3 3 
2 0 0 0 1 5 0 1 333 3 0 


(n,n) 
Find the solutions x of the following equations: 


—1 x x x 1 2 
(а) | 2 —1 2| = 27; (0) |3 x —l)=2. 
2 2 —1 4 x —2 


Find the solution of the following system of equations by Cramer’s rule: 


2x, +4x2 +3x3 = 1 
3x, —6x; —2x3 = -2. 
—5x, +8x2 +2x3 = 4 


Let А: R? — R? be a linear mapping described by matrix 
3 10 
А=|—1 2 4 
4 1 5 


Find the kernel of this mapping. 


Given are three linear mappings described by the following systems of equations: 


и] 


ll 
е 
= 
| 
е 
N 
+ 
c 
125] 


uj) = 20р =v —03 
из = =p +02 +203, 
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7.14 


7.15 


7.16 


7.17 


y= —W| +w3 
уу = wi +2w2 —W3 
yy- = w2 —2wa, 
w = х] —х2 —x3 
w = -—x| —2x2 +3x3 
из = 2x +23. 


Find ће composite mapping x є R? +> ue №. 
Find the inverse of each of the following matrices: 


1 0 3 
а) 4= [4 1 2|; 
011 
Quem» cq 
(b) B-|[3 4 -2]; 
5 1 =l 
] Ux 23 
© с=|0 2 4]; 
0 0 —1 
1 0 -1 2 
D wa ous 3 
© D=] „ 2 4 
0 1 2.45 
Let 
1 2 -1 0 4 
01 2 3 0 
S210 0 1 3 -1 
00 0 1 
00 0 0 1 


Find the inverse matrix by means of equality A = J — C. 
Given are the matrices 


—2 5 1 4 
4-( 1 3} and s-( 3 J 


Find (4B) ! and B | 4^ !. 
Given are the following matrix equations: 


(a) (XA) = B; (b XA=B-2X; (c) AXB=C; 
(d A(XB)!-2C; (e) C'xA-- (Tc)! = І – ЗСЇХ. 


Find matrix Х. 
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7.18 Given is an open input—output model (Leontief model) with 


0 02 01 03 
0 0 02 0.5 
m 0 0 0 0 
004 0 0 


Let x be the total vector of goods produced and y the final demand vector. 


(a) Explain the economic meaning of the elements of A. 

(b) Find the linear mapping which maps all the vectors x into the set of all final 
demand vectors y. 

(c) Is a vector 


100 
200 
200 
400 


of goods produced possible for some final demand vector y? 
(d) Find the inverse mapping of that obtained in (b) and interpret it economically. 


7.19 А firm produces by means of three factors R4, R? and Ёз five products P1, P5, . . ., P5, 
where some of these products are also used as intermediate products. The relationships 
are given in the graph presented in Figure 7.3. The numbers beside the arrows describe 
how many units of А; respectively P; are necessary for one unit of Pj. Let p; denote 


Ry Rə Ёз 
| (iene i ee? E 
| 1i | 
| hof 
a Bl | 
3 P2 | 
| | 
| pa 2 | 
| ja MES 
| P4 ERA c 
| Р | | 
| 2 | 
c ы] 

"93 "Ф "94 * q2 q5 


Figure 7.3 Relationships between raw materials and products in Exercise 7.19. 
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the produced units (output) of P; and g; denote the final demand for the output of 
Pii € {1,2,..., 5}. 


(a) Find a linear mapping p € R$ > qe RŽ. 

(b) Find the inverse mapping. 

(c) Let r? = (r1,r2,r3) be the vector which contains the required units of the 
factors А, Ro, Кз. Find a linear mapping q + г. Calculate r when q = 
(50, 40, 30, 20, 10)T. 


8 Linear equations and inequalities 


Many problems in economics can be modelled as a system of linear equations or a system 
of linear inequalities. In this chapter, we consider some basic properties of such systems and 
discuss general solution procedures. 


8.1 SYSTEMS OF LINEAR EQUATIONS 


8.1.1 Preliminaries 


At several points in previous chapters we have been confronted with systems of linear 
equations. For instance, deciding whether a set of given vectors is linearly dependent or 
linearly independent can be answered via the solution of such a system. The following 
example of determining feasible production programmes also leads to a system of linear 
equations. 


Example 8.1 Assume thata firm uses three raw materials R1, R2 and R3 for the production of 
four goods G1, G5, G3 and G4. There are the following amounts of raw materials available: 
120 units of А, 150 units of R2 and 180 units of R3. Table 8.1 gives the raw material 
requirements per unit of each good. Denote by x; the quantity of good G;, i € {1, 2,3,4}. We 
are interested in all possible production programmes which fully use the available amounts 
of the raw materials. 

Considering raw material Ку we get the following equation: 


1х1 + 2x2 + 1x3 + 3x4 = 120. 


Table 8.1 Raw material requirements for the goods G;, i € {1,2,3,4} 


Raw material Goods 

Gi G2 G3 G4 
Ry 1 2 1 3 
R5 2 0 3 1 
R3 1 4 2 4 
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Here 1x, is the amount of raw material А necessary for the production of good Сі, 2x2 the 
amount of А for good G2, 1x3 the amount of R; for good G3 and 3x4 is the amount of А 
required for the production of good Сд. Since all 120 units of raw material А should be 
used, we have an equation. Similarly we obtain an equation for the consumption of the other 
two raw materials. Thus, we get the following system of three linear equations with four 
variables: 


xy + 2x. + x3 + 3x4 = 120 
2х1 + 3x3 + x4 = 150 
xy + 4x. + 2x3 + 4x4 = 180 


Moreover, we are interested only in solutions for which all values x;,i € {1,2,3,4}, are 
non-negative. Considering e.g. the production programme x, = 40, x; = 15, x3 = 20 and 
x4 = 10, we can easily check that all equations are satisfied, i.e. this production programme 
is feasible, and there exists at least one solution of this system of linear equations. In order 
to describe all feasible production programmes, we have to find all solutions satisfying the 
above equations such that all values x;, i € {1,2,3,4}, are non-negative. 

Often, it is not desired that all raw materials should necessarily be fully used. In the 
latter case it is required only that for each raw material R; the available amount is not 
exceeded. Then all equality signs in the above equations have to be replaced by an inequality 
sign of the form <, and we obtain a system of linear inequalities which is discussed in 
Chapter 8.2. 


In the following, we discuss general methods for solving systems of linear equations. 
We answer the question of whether a system has a solution, whether an existing solution 
is uniquely determined and how the set of all solutions can be determined in the general 
case. 


Definition 8.1 The system 


a11X1 + a12X2 +--+ + йыл = bi 
а21х1 + a22X2 +--+ + aanxXn = b2 


(8.1) 

ат1х1 + dg2X2 + +++ + атпХп = bm 
is called a system of linear equations, where x1,x2,. .., x, are the unknowns or vari- 
ables, a11, 412, . . - , agg are the coefficients and bj, b», . . . , bm are called the right-hand 


sides. 


As an abbreviation, system (8.1) can also be written in matrix representation: 


Ax — b, 
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where the left-hand side corresponds to the multiplication of a matrix of order m x n by a 
vector (matrix of order n x 1) resulting in the m-vector b, i.e. 


ап ар © ап xl by 
a2) Aan © An x2 by 
Gm] Am “`` ат Xn bn 


Analogously, we can write a system of linear equations in vector representation as follows: 


n 
Ух) а/ = Ь, 
j=l. 


where 
aij 
| ar 
al = . + J = 1,2, ‚п, 
ат] 


are the column vectors of matrix А. The above left-hand side represents a linear combination 
of the column vectors of matrix A with the values of the variables as scalars, i.e.: 


ai 412 Ain by 

421 422 a2n b2 
xl ; + x2 Too Xp А = 

Om] m2 атп bm 


Next, we introduce some basic notions. 


Definition 8.2 If we have b; = 0, i = 1,2,...,m, in system (8.1), then this system 
is called homogeneous. If we have by #4 0 for at least опе k є {1,2,...,m}, then 
system (8.1) is called non-homogeneous. 


Definition 8.3 A vector x = (х1,х2,... ОУ which satisfies Ax = b is called а 
solution of system (8.1). The set 


S = (x eR" | Ax = b) 


is called the set of solutions or the general solution of system (8.1). 
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8.1.2 Existence and uniqueness of a solution 


Next, we investigate in which cases system (8.1) has a solution. To this end, we introduce 
the notion of the rank of a matrix. For this purpose, the following property is useful. 


THEOREM 8.1 Теѓ А be a matrix of order m х n. Then the maximum number of linearly 
independent column vectors of A coincides with the maximum number oflinearly independent 
row vectors of A. 


Definition 8.4 Let 4 bea matrix of order m x n. The rank of matrix A, written r (A), is 
the maximum number p of linearly independent column (or according to Theorem 8.1 
equivalently, row) vectors in A. If A is any zero matrix, we set r(A) = 0. 


As an obvious consequence of Definition 8.4, we obtain r(4) = p < min(m,n). The 
following theorem gives a first criterion to determine the rank of a matrix A. 


THEOREM 8.2. The rank r(A) of matrix A is equal to the order of the largest minor of A that 
is different from zero. 


We recall that a minor of a matrix 4 was defined as a determinant of a square submatrix of A. 
The above criterion can be used when transforming the determinant of a matrix A in such 
a way that the order of the largest minor can be easily obtained. Otherwise, it can be rather 
time-consuming to determine the rank of a matrix by applying Theorem 8.2. Consider the 
following two examples. 


Example 8.2 Let 
120 
A=|4 6 2 
3 2 4 


We obtain |A| = 0 which means that matrix A cannot have rank three. However, e.g. for the 
minor obtained from matrix A by deleting the last row and column, we get 


1 2 
[433] a А 6 |= —2, 


ie. there is a minor of order two which is different from zero, and thus matrix A has 
rank two. 


Example 8.3 Let us consider the matrix 


1-х -1 1 
А = 1 1-х 3 
1 0 1 
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with x € IR. We determine the rank of matrix A in dependence on the value of x € R. 
Expanding |A| by the third row, we get 


-1 


1 
а=, 3 


ч 


1 1—х 
=—3-@-%+]@-%@-+1] 


= (—4 + x) + (x? —2x + 2) 
—x-x-2. 

We determine the roots of the equation x? — x — 2 = 0 and obtain x; = —1 and x; = 2. 
Thus, we get |4| Æ 0 for the case x zz —1 and x Æ 2, i.e. due to Theorem 8.2, we have 
r(A) = 3 for x € R \ {-1,2}. For x є (—1,2], we obtain for the minor formed by rows 1 
and 3 as well as columns 2 and 3 


—1 1 
Thus, we obtain r(4) — 2 for the case x — —1 and for the case x — 2 because the order of 


the largest minor with a value different from zero is two. 


Definition 8.5 We define the m x (n + 1) augmented matrix 


ап ар c Ain | by 

а22 Ere An b2 

Ap — (4 |b) = 1 
а am a Ám bm 


as the coefficient matrix A expanded by an additional column containing vector b of 
the right-hand side. 


Obviously, we have r(4) < r(A») since matrix A, contains an additional column vector in 
comparison with matrix A. Moreover, since the augmented matrix differs from the coefficient 
matrix A by exactly one column, there are only two cases possible: either r(45) = r(A) or 
r(Ap) = r(4) + 1. 


Definition 8.6 If system (8.1) has at least one solution, it is said to be consistent. If 
this system has no solution, it is said to be inconsistent. 


Next, we present a necessary and sufficient condition for the case where a system of linear 
equations has at least one solution. 
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THEOREM 8.3 System (8.1) is consistent if and only if the rank of the coefficient matrix A 
is equal to the rank of the augmented matrix 45 = (A | b), i.e. 


system Ax = b is consistent <= r(A) = r(Ay). 


Since for a homogeneous system of equations, the augmented matrix contains matrix A plus an 
additional zero (column) vector, the number of linearly independent (column or row) vectors 
of the augmented matrix is always equal to the number of linearly independent vectors of 
matrix A. This leads to the following corollary. 


COROLLARY 8.1 A homogeneous system Ax = 0 is always consistent. 


Indeed, for system Ax = 0 we have r(A) = r(Ap). We can note that a homogeneous system 
has at least the so-called trivial solution x! = (x1,x2,...,Xn) = (0,0,...,0). 


Next, we deal with the following question. If system (8.1) is consistent, when is the solution 
uniquely determined? An answer is given by the following theorem. 


THEOREM 8.4 Consider the system Ax = b of linear equations, where А is a matrix of 
order m x n, and let this system be consistent. Then: 


(1) If r(4) = (Аһ) = n, then solution x = (x1,x2, ..., Xn)! is uniquely determined. 
(2) If r(4) = r(A) = p < n, then there exist infinitely many solutions. In this case, the set 
of solutions forms an (n — p)-dimensional vector space. 


Incase (2) of Theorem 8.4 we say that the set of solutions has dimension n — p. Let us consider 
part (2) of Theorem 8.4 in a bit more detail. In this case, we can select n — p variables that can 
be chosen freely. Having their values fixed, the remaining variables are uniquely determined. 
We denote the n — p arbitrarily chosen variables as free variables, and we say that the system 
of linear equations has n — p degrees of freedom. 


8.1.5 Elementary transformation; solution procedures 


Solution procedures for systems of linear equations transform the given system into a ‘system 
with easier structure'. The following theorem characterizes some transformations of a given 
system of linear equations such that the set of solutions does not change. 


THEOREM 8.5 The set of solutions of system (8.1) does not change if one of the following 
transformations is applied: 


(1) An equation is multiplied by a number А Z 0 or it is divided by a number A Æ 0. 
(2) Two equations are interchanged. 
(3) A multiple of one equation is added to another equation. 


Operations (1) to (3) are called elementary or equivalent transformations. By such elementary 
transformations, the rank r(A) of a matrix A does not change either (see also rules for 
evaluating determinants given in Chapter 7.3). Finally, we introduce a special form of a 
system of linear equations and a solution as follows. 
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Definition 8.7 Asystem Ax = b ofp = r(A) linear equations, where in each equation 
one variable occurs only in this equation and it has the coefficient +1, is called a system 
of linear equations in canonical form. These eliminated variables are called basic 
variables (bv), while the remaining variables are called non-basic variables (nbv). 


Hence the number of basic variables of a system of linear equations in canonical form is 
equal to the rank of matrix A. As a consequence of Definition 8.7, if a system of linear 
equations Ax = b is given in canonical form, the coefficient matrix A always contains an 
identity matrix. If r(4) = p = n, the identity matrix J is of order n x n, i.e. the system has the 
form 


IxP — b, 


where xP is the vector ofthe basic variables. (Note that columns might have been interchanged 
in matrix A to get the identity matrix, which means that the order of the variables in vector 
xP is different from that in vector x.) If r(4) = p < n, the order of the identity submatrix is 
p х p. In the latter case, the system can be written as 


Ix? + AyxN = b, 


where x® is the p-vector of the basic variables, xN is the (n — p)-vector of the non-basic 
variables and Ay is the submatrix of A formed by the column vectors belonging to the non- 
basic variables. (Again column interchanges in matrix A might have been applied.) This 
canonical form, from which the general solution can easily be derived, is used in one of the 
solution procedures described in this subsection. 


Definition 8.8 A solution x of a system of equations Ax = b in canonical form, 
where each non-basic variable has the value zero, is called a basic solution. 


Thus, if matrix A is of order p x n with r(A) = p < n, then at least n — p variables are equal 
to zero in a basic solution of the system Ax = b. The number of possible basic solutions of 
a given system of linear equations is determined by the number of different possibilities of 
choosing p basic variables. That is, one has to find among the column vectors of matrix A all 
possibilities of p linearly independent vectors belonging to the p basic variables. There exist 
at most () basic solutions (see Chapter 1.3 on Combinatorics). 


One method of solving systems of linear equations has already been discussed in Chapter 7.3, 
but remember that Cramer’s rule is applicable only in special cases. The usual methods of 
solving systems of linear equations apply elementary transformations mentioned in Theo- 
rem 8.5 to transform the given system into a form from which the solution can be easily 
obtained. The methods typically used transform the original system into either 


(1) a canonical form according to Definition 8.7 (pivoting procedure or Gauss-Jordan 
elimination) or into 
(2) a ‘triangular’ or echelon form (Gaussian elimination). 
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It is worth noting that the notation for both procedures in the literature is not used in a 
standard way; in particular Gaussian elimination is often also denoted as pivoting. The 
reason is that both procedures are variants of the same strategy: simplify the given system of 
linear equations in such a way that the solution can be easily obtained from the final form of 
the system. We now discuss both methods in detail. 


Pivoting 


First, we discuss the pivoting procedure. The transformation of the original system into a 
canonical form (possibly including less than m equations) is based on the following theorem 
and the remark given below. 


THEOREM 8.6 Let Ax = b be a given system of m linear equations with n variables and 
r(A) = p < min{m,n}. Then the augmented matrix 45 = (A | Б) can be transformed by 
applying Theorem 8.5 and column interchanges into the form 


10... 0 aÏ p41 . ау„| Ь] 
0 1 ... 0 a p41 vp а„| Ь 
x * * 
(4* |b) = : : m : Eu “om К, (82) 
" A ut 
0 0 о о. 0| 0 
фе Oe HD. Gene К. 


with bu = 0or bu z 0. 


It is easy to see that the matrix A* given in (8.2) (and therefore also the original coefficient 
matrix A) have rank p. In terms of Theorem 8.2, this means that matrix A* has a minor of 
order p whose value is different from zero. This can easily be seen by taking the identity 
submatrix obtained in the left upper part (printed in bold face) whose determinant is equal to 
one. However, there is no minor of a larger order than p whose value is different from zero. 
(If we add one row and one column, the value of the determinant is equal to zero since there is 
one row containing only zero entries.) Notice also that the first p rows in representation (8.2) 
describe a system of linear equations in canonical form. 


Remark Inthe case when (А) = p is not smaller than min(m, n}, we can transform matrix 
A into one of the three following forms (A* | b*): 


(1) If m < n, then 


10... 0 d nil ә. adfa | b7 
01... 0 а... а | b 

(4* | P) = E : E: Е 7 i (8.3) 
0 0 1 a* at, | b* 
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(2) Ifm > n, then 


1 0 0 bi 
0 1 0 ЬУ 
0 0 1 b% 

(A* | 5 = a (8.4) 
0 0 0 | Br. 
0 0 0; 0 
оо... 0|] 0 

with Ьу = 0 or b5; 3 0. 
(3) Ifm = n, then 

1 0 0 bi 
0 1 ... 0) 5 

(A* [5% = |. . а S (8.5) 
0 0 1| 5, 


Each ofthe matrices in (8.2) to (8.5) contains an identity submatrix. The order of this identity 
matrix is the rank of the coefficient matrix A originally given. In the transformed matrix 
A*, each column corresponds to some variable. Since column interchanges were allowed, 
we simply denote the variable belonging to the first column as the first basic variable хві, 
the variable belonging to the second column as the second basic variable xp? and so on. 


(If no column interchanges were applied, we have the natural numbering x1,x2, . . . , xp.) 
Accordingly, we denote the variables belonging to the columns which do not form the identity 
submatrix as the non-basic variables xy1,xN2,..-,XN,n—p- 


We now discuss how the solutions can be found from the system in canonical form given by 
the first rows of matrices (8.2) to (8.5) (including the identity matrix). 


In the cases defined by matrices (8.3) and (8.5), there always exists a solution (r(4) — 
r(Ap) — m). In particular, in the case of matrix (8.5) we have the unique solution 


* * * 
XB1 = Бү, xg) = b>, Dry XBn = b,. 


(Note that variables might have been interchanged.) In the case of matrix (8.3), we have 
n — m degrees of freedom, i.e. n — m variables belonging to columns т + 1,m + 2,...,n 
can be chosen arbitrarily. 


In the case of matrix (8.4), there exists a unique solution 
хві = bj, xg) = $, dis XBn = y, 


provided that b% 41 = 9, otherwise the system has no solution. In the case of p < min{m, n] 
considered in Theorem 8.6, the system is consistent if and only if b +1 = 0, and the system 
of the first p equations obtained by the transformation of matrix (A | b) into (4* | b*) isa 
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system in canonical form which can be written as 


хві + арыт coc + архулу =b] 
* * EE 

ХВ2 + yyy NL + © + G.XNa-p = 65 

Xgp + 45 541XN1 nb See cyst арһ®М л—р = b. 


Since r(4) = r(A*) = p, there are n — p degrees of freedom, i.e. n — p variables can be 
chosen arbitrarily. We can rewrite the system in canonical form in terms of the basic variables 
as follows: 


E * * = aie 25, ж 
хві = bj — 41 p+1%N1 Qs XN ,n—p 
хво bj — а5 p41¥N1 — ++ — GQNa-p 
= * * - * 
XBp — b, — аруым: — i арпХ№ л-р 


хм arbitrary for j = 1,2,...,n — p. 


We emphasize that, if хві, хв2,...,хвр are the basic variables, this means that the column 
vectors of matrix A belonging to these variables are linearly independent and form a basis of 
the vector space spanned by the corresponding p column vectors of matrix A. If we choose 
хм = 0,7 = 1,2,...,n — p, we obtain the basic solution 


ж ж ж 
хві = bj, xg2 = 5, icu XBp — by. 


Next, we discuss how we can transform the augmented matrix A, = (A | Б) into matrix 
(A* | b*) by elementary transformations in a systematic way from which the canonical form 
can be established. 


These formulas correspond to those which have been presented in Chapter 6.4 when consid- 
ering the replacement of one vector in a basis by another vector of a vector space. Assume 
that variable x; should become the basic variable in the kth equation of the system. This can 
be done only if aj; 4 0. The element ay is called the pivot or pivot element. Accordingly, 
row Ё is called the pivot row, and column / is called the pivot column. The transformation 
formulas distinguish between the pivot row and all remaining rows. They are as follows. 


Transformation formulas 


(1) Pivot row k: 


_ ag 

ag = —, j=1,2,...,n 
ak] 

by = — 
ak] 


(2) Remaining rows i = 1,2,...,т, i Æ k: 
= ай = : 
aj —aj — —-ag = ау — арац, jf = 1,2,...,п 
ak] 
ns aij fet 
bj = bi — — - b = bi — by: ai 
ak] 
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These transformations can be performed by using the following tableaus, where rows | to m 
in the first tableau give the initial system, and rows m + 1 up to 2m in the second tableau give 
the system of equations after the transformation. The column bv indicates the basic variables 
in the corresponding rows (1.e. after the pivoting transformation, x; is basic variable in the 
kth equation of the system, while we do not in general have basic variables in the other 
equations yet). The last column describes the operation which has to be done in order to get 
the corresponding row. 


Row bv x X2 < xX с Xn b Operation 
1 — ар ару cs ац oc an b 
— ау an v: ay ss am b, 
k — а ар c5 ац co а bk 
m — Amy am `’ аш coo dmn bm 
m+1 — ayy 12 «°° 0 tee |n bi row 1 — 212 row k 
m+2 — 221 а) С 0 DE аһ by row 2 — 221 row k 
m+k Xp ад аю co 1 DE ањ by a row k 
2m — а üm co 0 o m bm rowm— zu row k 


Here all elements in the pivot column (except the pivot element) are equal to zero in the new 
tableau. After the transformation, we have the basic variable x; in row m + k which occurs 
with coefficient +1 now only in equation k of the system and the transformed equation can 
be found now in row m 4- k of the second tableau. To illustrate the computations in all rows 
except the pivot row, consider the determination of element 222. We have to consider four 
elements: the number a22 on the original position, the element аз; of the second row standing 
in the pivot column, the element арз of the pivot row standing in the second column and 
the pivot element ax. (The corresponding elements are printed in bold face in the tableau.) 
These elements form a rectangle, and therefore the rule for computing the values in all rows 
except the pivot row in the new scheme is also known as the rectangle rule. 


Example 8.4 Let us consider the following system of linear equations: 


X| + 3x3 — 2x4 =5 
3x + x + 4x + x4 =5 
4x + x + 73 + x, =10 
2x, + x) + x3 + 3x4 =0 


Applying the pivoting procedure, we get the following sequence of tableaus. (Hereafter, the 
pivot element is always printed in bold face, and the last column is added in order to describe 
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the operation that yields the corresponding row): 


Row bv xy x? x4 x4 b Operation 
1 - 1 0 3 -2 5 
2 - 3 1 4 1 5 
3 - 4 1 7 1 10 
4 = 2 1 1 3 0 
5 x] 1 0 3 -2 5 rowl 
6 - 0 1 -5 7 —10 row2-—3rowl 
7 - 0 1 —5 9 —10 row3-—4rowl 
8 — 0 1 —5 7 —10 row4—2rowl 
9 ХІ 1 0 3 —2 5 row 5 
10 x» 0 1 —5 7 —10 row6 
11 - 0 0 0 2 0 row7-— row6 
12 — 0 0 0 0 0 row8— row6 
13 xy 1 0 3 0 5 row9+ row 11 
4 x 0 1 -5 0 -10 rowl0— row ll 
5 x4 0 0 0 1] 0 юми 
16 — 0 0 0 0 0 row 12 


We avoid the interchange of the column ofx3 with the column of x4 to get formally the structure 
given in Theorem 8.6. In the first two pivoting steps, we have always chosen element one as 
the pivot element, which ensures that all above tableaus contain only integers. In the second 
tableau (i.e. rows 5 up to 8), we could already delete one row since rows 6 and 8 are identical 
(therefore, these row vectors are linearly dependent). From the last tableau we also see that the 
rank of the coefficient matrix is equal to three (since we have a 3 x 3 identity submatrix, which 
means that the largest minor with a value different from zero is of order three). Since x), x2 
and хд are basic variables, we have found that the column vectors belonging to these variables 
(i.e. the first, second and fourth columns of A) are linearly independent and constitute a basis 
of the space generated by the column vectors of matrix A. From rows 13 to 16, we can rewrite 
the system in terms of the basic variables: 


X1 = 5 — 3x3 
x2 = —10 + 5x3 
ха = 0 


(хз arbitrary). 
Setting now x3 = t witht € К, we get the following set of solutions of the considered system: 
xı =5-3t; x2 = —10 + 5t; x3 = 0; x4 = 0; teR. 


Since we know that one variable can be chosen arbitrarily, we have selected the non-basic vari- 
able x3 as the free variable. From the last tableau we see that we could not choose x4 as the free 
variable since it must have value zero in any solution of the given system of linear equations. 
However, we can easily see from the transformed system of linear equations, whether one or 
several variables are uniquely determined so that they cannot be taken as free variables. 
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In the latter example, there was (due to n — p = 1) one variable that can be chosen arbitrarily. 
In this case, we also say that we have a one-parametric set of solutions. 


Gaussian elimination 


Next, we discuss Gaussian elimination. This procedure is based on the following theorem. 


THEOREM 8.7 Let Ax = b be a given system of m linear equations with n variables and 
r(A) = p < min{m,n}. Then the augmented matrix Ap = (A | Б) can be transformed by 
applying Theorem 8.5 and column interchanges into the form 


a aj ... a, d p41 TL E 

* * * * 
0 ay ... а ау © On 2 

a^ | bY) = 0 0 232 a, 2541 "T арп b, (8.6) 
0 0 ... 0 0 e. 0 PE, 
0 0 0 0 axe 00 0 
0 OJ uns 0 0 sex 0 0 
with aj, «455 -...: a5, # 0 and bj, , = 0 or by, £0. 


In terms of Theorem 8.2, the transformed matrix A* in Theorem 8.7 (and the original matrix 
A too) possesses a minor of order p whose value is different from zero. This is the minor 
formed by the first p rows and columns in representation (8.6). Since all diagonal elements 
are different from zero, the value of the determinant is equal to the product of these diagonal 
elements (see Theorem 7.4). However, matrix A* (and matrix A too) does not have a minor of 
order p + 1 which is different from zero (in each minor of matrix A*, there would be one row 
containing only zero entries, and by Theorem 7.7 in Chapter 7.3, this determinant is equal to 
zero). 


If p is not smaller than the minimum of m and n, we can transform the augmented matrix 45 
similarly to Theorem 8.7, as described in the following remark. 


Remark In the case when rank r(A) = p is not smaller than min(m, n}, we can transform 
the augmented matrix A, = (A | b) by elementary transformations and column interchanges 
to get one of the following special cases: 


(1) If m < n, then 


ж ж ж ж 

а ар aim ауы ai, | bi 

0 а ... aš аў e. 0, bž 

22 2m 2,m+1 2n 2 

ж ж э! 

(A* |b") = |. : : . i : (8.7) 

ж ж ж ж 

0 0 апт Ond айл | 05 


; * * * 
with a, © 45°... ап £ 0. 
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(2) Ifm > n, then 


aj aj, ... aj| bj 
0 a, ... а bj 
0 0 ... а* b* 
* жү |. пп п 
Е ME лый, (8.8) 
0 0 ... 0 0 
0 0 ... 0 0 
with ат аў»... a5, A 0 and bz, , = 0 orb, £0. 
(3) If m — n, then 
"n а; PN aj, 7 
aW cu 
(4* | b*) = s al (8.9) 
0 0 .. а | bX 
with aj, а5, -... ах, #0. 


Considering the largest order of a minor different from zero, we conclude that in case (1) 
above, matrix A has rank m, in case (2) matrix A has rank n and in case (3), matrix A has rank 
m — n. The discussion of the consistency of the system of linear equations and the selection 
of the free variables is the same as for the pivoting procedure. 


In all matrices (8.6) to (8.9), an upper triangular submatrix with non-zero entries on the 
diagonal is contained. We next describe the systematic generation of the triangular form 
given in Theorem 8.7 and the above remark. Assume that a11 4 0 (otherwise we interchange 
two rows or columns). Then we transform all equations except the first one, where the new 
kth equation, k € {2,3,...,m}, is obtained by multiplying the first equation by —aj1/a11 
and adding the resulting equation to the original kth equation. Element а is also denoted 
as the pivot or pivot element. This leads to the following system: 


auxi + anx + +--+ + ах =), 
аә + > + Фах =b 
400 + +++ + GmnXn = bg, 
where 
E ак ө 
dg = ау – = ау, К = 2,3,...,т, j = 2,3,...,п. 
11 


By the above transformation we have obtained а system where variable x; occurs only in the 
first equation, i.e. all elements below the pivot element are now equal to zero. Now we apply 
this procedure to equations 2,3,...,m provided that a2 з 0 (otherwise we interchange 
two rows or columns) and so on until a triangular form according to (8.6) to (8.9) has been 
obtained. 
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From the triangular system, we determine the values of the variables by ‘back substitution’ 
(i.e. we determine the value of one variable from the last equation, then the value of a second 
variable from the second to last equation and so on). It is worth noting that only the first 
equations belonging to the triangular submatrix are necessary to find the general solution, 
while the remaining equations are superfluous and can be skipped. We illustrate Gaussian 
elimination by the following examples. 


Example 8.5 Consider the following system of linear equations: 


Applying Gaussian elimination we obtain the following tableaus. 


Row ху x? x3 b Operation 


1 1 1 3 
2 1 -1 2 2 
3 6 —1 9 
4 1 1 3 rowl 


tA 
о ~ 
| 

кә 

_ 


—] row2-rowl 
6 0 2 —5 —3 row3-—4rowl 


ә 
— 
— 
- 


3 row4 
8 0 -2 1 -1 row5 
9 0 0 —4 -—4 row6+row5 


From rows 7 to 9, we see that both the coefficient matrix А and the augmented matrix A, have 
rank three. Therefore, the given system is consistent and has a unique solution. Moreover, 
we get the following triangular system: 


xy + x2 + аз —3 
— 2xj + x —— 
= 4x3 =—4 


Applying back substitution, we get from the last equation x3 = 1. Then we obtain from the 
second equation 


2x2 = —1 — x3 = —2 


which yields x2 = 1, and finally from the first equation 


xy =3-x2-x3=3-1-1 


which gives x} = 1. 
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Example 8.6 We solve the following system of linear equations: 
xj + x2 + x3 =2 
3x, + 2x2 + хз = 
2x + 3x2. + 4x =l. 


Applying Gaussian elimination, we obtain the following tableaus. 


Row xj хә x3 b Operation 


W N 
w 
W N 
— 
N 


4 1 1 1 2 rowl 
—] -2 —4 row2-3rowl 
1 2 —3 row3-2rowl 


N 
— 
— 
— 
N 


row 4 
—] -2 —4 row5 
0 0 —7 row6+row5 


oo 


From row 9 we see that the considered system has no solution since this equation 
Ox, + 0x? + 0x3 = —7 


leads to a contradiction. The rank of the coefficient matrix A is equal to two but the rank of 
the augmented matrix Ар is equal to three. 


8.1.4 General solution 


We now investigate how the set of solutions of a system of linear equations can be written 
in terms of vectors. The following theorem describes the general solution of a homogeneous 
system of linear equations. 


THEOREM 8.8 Let Ax = 0 bea homogeneous system of m linear equations with n variables 
and r(A) = p < n. Then there exist besides the zero vector n — p further linearly independent 
solutions x!,x?,...,x”-? € R” of the system Ax = 0, and the set of solutions Sz can be 


written as 


SH = [x € R” | х = Aix!  A2x2 +--+ An px”, AAZ.. An p € R] : 


According to Definition 7.15 the set Sg corresponds to the kernel of a linear mapping 
described by matrix A. In order to present the general solution, we need n — p linearly 
independent solution vectors. To illustrate how these vectors can be found, we consider the 
following example of a homogeneous system of linear equations and determine the general 
solution of this system. 
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Example 8.7 Let Ax = 0 with 


1 2 0 1 1 

=1 c]. 2. d 

a> —1 1 -1 2 
1 1 1 0 3 


Applying Gaussian elimination, we obtain the following tableaus: 


Row ху ху хз x4 x5 b Operation 
1 2 0 1 1 0 
2 -1 -3 1 -2 1] 0 
3 0 1 -1 2 0 
4 1 1 0 3 0 
5 1 2 0 1 1 0 rowl 
6 0 —1 1 —1 2 O0 rowl+row2 
7 0 —1 1 -1 2 0 row3 
8 0 —1 1 —1 2 0 -rowl-crow4 


We can stop our computations here because rows 7 and 8 are identical to row 6 and thus we 
can drop rows 7 and 8. (If we continue applying Gaussian elimination, we obtain two rows 
containing only zeroes.) Hence we use a triangular system with the following two equations 
obtained from rows 5 and 6: 


xy + 2x2 + x + x5 =0 
— X2 + x3 — X4 + 2x5 =0 (8.10) 


Thus, we have r(4) = 2. Because n = 5, there are three linearly independent solutions x! х2 


and x?. They can be obtained for instance by setting exactly one of three variables that can 
be chosen arbitrarily to be equal to one and the other two of them to be equal to zero, i.e. 


Now the remaining components of each of the three vectors are uniquely determined. It is 
clear that the resulting vectors are linearly independent since the matrix formed by the three 
vectors x!, x? and X? is a3 x 3 identity matrix. Determining the remaining components from 
system (8.10), we obtain 


о о н Fe 
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Thus, according to Theorem 8.8 we get the following set Sy of solutions: 


292 1 —5 
1 -1 2 

Sy= 4x? eR?) х-л 1 |+ 0 [+в] 9 |; А, ER 
0 1 0 
0 0 1 


The following theorem characterizes the general solution of a non-homogeneous system of 
linear equations. 


THEOREM 8.9 Let Ax = b be a system of m linear equations with лп variables and 
r(A) = p « n. Moreover, let 


Sy = [x € R” | xF = Ах! Xx? o Appx” P, A, Aas. An p € R] 


denote the general solution of the homogeneous system Ax = 0 and x" be a solution of the 
non-homogeneous system Ax = b. Then the set of solutions S of system Ax = b can be 
written as 


$- [xem ix ox +x", x! e sy] 


= {x eR’ |х = Ах! + Ах? qe. + Ag- px" Р +x, A1, À2,..., An-p € R}. 


Theorem 8.9 says that the general solution of a non-homogeneous system of linear equations 
is obtained as the sum of the general solution of the corresponding homogeneous system (with 
vector b replaced by the zero vector 0) and a particular solution of the non-homogeneous 
system. 


As we see in this and the next chapters, the pivoting procedure is also the base for solving 
linear systems of inequalities and linear programming problems. The Gaussian elimination 
procedure is sometimes advantageous when the system contains parameters. Finally, we give 
a more complicated example of a system of linear equations containing a parameter a, and 
we discuss the solution in dependence on the value of this parameter. 


Example 8.8 We determine all solutions of the following system of linear equations: 


ах + x? + x3 ë =l 
X1 + ах + X3 —a 
х + X2 + ахз =a, 
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where a € R. We interchange the first and third equations. Applying now Gaussian elimi- 
nation, this gives the first pivot 1 (in the case of taking the original first equation, we would 
have pivot a, and this means that we have to exclude the case a = 0 in order to guarantee 
that the pivot is different from zero). We obtain the following tableaus. 


Row x, x2 X3 b Operation 
1 1 a 1 a 
2 1 1 a a 
3 a 1 1 1 
4 1 a 1 a row 1 
5 0 l-a a—1 a?—a row 2— row 1 
6 0 —a? +1  —a+1 —a?+1 row3-—arowl 
7 1 a 1 a row 4 
8 0 1-а a—1 а-а том5 
9 0 -а2-а+2 0 1-а  row6+ row 5 


In row 5, we have taken a — 1 in the x3 column as pivot and this generates a zero below this 
element. (Notice that this implies a — 1 Æ 0.) In the third tableau (rows 7 to 9), we can still 
interchange the columns belonging to variables x? and x3 (to generate formally a triangular 
matrix). We first consider the case when the coefficient of variable хә in row 9 of the above 
scheme is equal to zero, і.е. —a? — a + 2 = 0. For this quadratic equation we get two real 
solutions ај = 1 and a? = —2. Ifa = a4 = 1, then the right-hand side 1 — a is also equal to 
zero. Moreover, in this case also all elements of row 8 are equal to zero, but row 7 contains 
non-zero elements. Hence both the coefficient matrix and the augmented coefficient matrix 
have rank one: r(A) = r(A | b) = 1, and there exist infinitely many solutions (two variables 
can be chosen arbitrarily). Choosing variables x? = s € R and x3 = t € R arbitrarily, we get 
xj—-1-—s-t. 

This solution can be alternatively written using Theorem 8.9. To find x”, we determine xl 
using xl = 5 = 1,x} = t = 0 which yields xt = —] (notice that x1 = —s — t in the 
homogeneous system) and x? using x2 =s= 0, x3 = f = 1 which yields х2 = —]. To geta 
particular solution x of ће non-homogeneous system, we set xY = x = 0 which yields 
= 1. Therefore, we get the general solution S as follows: 


$- |хев ix 2x ex"] 


—1 —1 1 
=fxeR |x=r,{ 1 |+| 0 |+ 0]; лєк 
0 1 0 
If a = a» = —2, then the right-hand side 1 — a = 3 is different from zero. In this case, we 


have r(A) = 2 but r(A45) = 3. Therefore, the system of linear equations is inconsistent, and 
there is not a solution of the system. 

Consider now the remaining cases with а Æ 1 and a zz —2. In all these cases we have 
r(A) = r(Ap) = 3, and thus there exists a uniquely determined solution. Using rows 7 to 9 
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in the above tableau, we get 


l-a l-a 1 
X2 = 2 = = 
—a*—a+2 (a—l1)(a+2) а+2 
a —a—(1—a» a(a—1)—(1—a)x2 
X3 = = =a+x2 
a—1 a—1 
1 a+2a+1 (a+1)? 
=at = = 
a+2 a+2 a+2 
(a+ 1)? a 
х= а — хз — ах =a 
а+ 2 а+ 2 
G4 +2а- а - 2а-1-а © а+1 
B a+2 ^ at? 


We have already discussed a method for determining the inverse of a regular matrix A (see 
Chapter 7.5). The method presented there requires us to evaluate a lot of determinants so 
that this is only efficient for matrices of order n x n with a small value of n. Of course, we 
can also determine matrix A~! via the solution of systems of linear equations with different 
right-hand side vectors, as in the following section. 


8.1.5 Matrix inversion 


The inverse X = 47! of a matrix A of order n x n satisfies the matrix equation 
AX — I, 


assuming that |A| # 0 (notice that this corresponds to the condition r(4) = n). Let the 
column vectors of matrix X be: 


X11 X12 Xin 

1 X21 2 X22 P X2n 
X = . > x = . , , X == 

Хп] Xn2 Xnn 


Then the matrix equation AX = J can be written as 
A(x! х2... х") = = (ele? ... e”) 

which is equivalent to the following n systems of linear equations: 
Ax! = el, Ах? =e, ..., Ax" = е". 


These n systems of n linear equations differ only in the right-hand side vector and all the 
transformations of the coefficient matrix A within the application of a solution method are 
the same. Thus, applying for instance the pivoting procedure, we can use the following 
scheme for solving simultaneously these systems of n linear equations, i.e. if r(4) = n, we 
obtain after n pivoting steps the inverse 4^! = X = (xy). In the following tableaus we have 
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assumed that no rows and columns have been interchanged (so the variables occur according 
to their natural numbering), i.e. the elements on the diagonal have successively been chosen 
as pivots. Of course, we can analogously also apply the Gaussian elimination procedure to 
solve the above systems with different right-hand side vectors. 


Row bv A I 
1 = ay а ain 1 0 
2 — a2, 422 an 1 0 
n = а am ann 0 0 1 
п(п= 1) +1 x, 1 0 ted 0 Хүр X12 «+. Xin 
пп-1) +2 x 0 1 ИР 0 хәр X22 ... Xn 
n? Xn 0 0. uu l Xni Xm ... Xm 
I 4A 


Example 8.9 We consider matrix 


1 3 1 
А= |2 -1 1 
1 1 1 


and determine the inverse 4^! by means of the pivoting procedure. The computations are 
shown in the following scheme. 


Row bv A I Operation 

1 — 1 3 1 1 0 0 
— 2 —1 1 0 1 

3 = 1 1 0 0 1 
4 x1 1 3 1 1 0 0 row 1 
5 = 0 7 1 2 1 0 row 2 — 2 row 1 
6 — 0 —2 0 -1 0 1 row 3 — row 1 
7 xj 1 0 1 i 3 row 4 + 3 row 5 
8 x2 0 1 1 2 –1 — 1 row 5 
9 = 0 ; -3 -3 1 row 6 — 2 row 5 
10 xj 1 0 0 1 -2 row 7 —2row 9 
11 x2 1 0 1 -4 row 8 — 5 row 9 
12 x3 0 1 -3 - 2 7 row 9 


Since it is possible to generate an identity matrix on the left-hand side (see rows 10 to 12), 
the rank of matrix A is equal to three and the inverse A~! exists. From rows 10 to 12, 
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we obtain 
1 1 -2 
1 1 
41= |2 9-5 
3 7 
= ою E 


Computing the product 474 gives the identity matrix and confirms the correctness of our 
computations. 


8.2 SYSTEMS OF LINEAR INEQUALITIES 


8.2. Preliminaries 


We start this section with the following introductory example. 


Example 8.10 А drink consisting of orange juice and champagne is to be mixed for a party. 
The ratio of orange juice to champagne has to be at least 1 : 2. The total quantity (volume) 
of the drink must not be more than 30 1, and at least 4 1 more orange juice than champagne 
are to be used. 


We denote by x, the quantity of orange juice in litres and by x? the quantity of champagne 
in litres. Then we get the following constraints: 


ХІ 3 x2 > 1:2 
Xp X2 < 4 
xj + x) < 30 

XpX2 = 0. 


Hereafter, the notation x, x2? > 0 means that both variables are non-negative: x; > 0,x2 > 0. 
The first inequality considers the requirement that the ratio of orange juice and champagne 
(i.e. x1 : x2) should be at least 1 : 2. The second constraint takes into account that at most 
41 orange juice more than champagne are to be used (i.e. x1 < х2 +4), and the third constraint 
ensures that the quantity of the drink is no more than 30 1. Of course, both quantities of orange 
juice and champagne have to be non-negative. 


The first inequality can be rewritten by multiplying both sides by —2x2 (note that the inequality 
sign changes) and putting both variables on the left-hand side so that we obtain the following 
inequalities: 


—2x| + l2 x 0 
х — x < 4 

х + x) < 30 
XpX2 = 0. 


In this section, we deal with the solution of such systems of linear inequalities with non- 
negative variables. 
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In general, we define a system of linear inequalities as follows. 


Definition 8.9 The system 


aux. + арх + > + амым R b 
алх + anx + ©- + ах R b 

i RS (8.11) 
Am1X1 + amx2 + +++ + mnXn Rm bn 


is called a system of linear inequalities with the coefficients aj, the right-hand sides b; 
and the variables x;. Here R; € {<,=,>}, i = 1,2,..., т, means that one of the three 
relations <, =, or > should hold, and we assume that at least one inequality occurs in 
the given system. 

The inequalities 


xj = 0, Ed! « (l2. sso) (8.12) 
are called non-negativity constraints. 


The constraints (8.11) and (8.12) are called a system of linear inequalities with |J | 
non-negativity constraints. 


In the following, we consider a system of linear inequalities with п non-negativity constraints 
(i.e. J = (1,2,...,n]) which can be formulated in matrix form as follows: 


AxRb, x>0, (8.13) 


where R = (R1, R2, ..., Rm)! denotes the vector of the relation symbols with R; € {<,=, >}, 
ba 1, 2) um. 


Definition 8.10 A vector x = (x1,x2,...,X,)' € R” which satisfies the system 
Ax R bis called a solution. If a solution x also satisfies the non-negativity constraints 
x > 0, it is called a feasible solution. The set 


M = {xeR"|4xRb,x>0} 


is called the set of feasible solutions or the feasible region of a system (8.13). 


82.2 Properties of feasible solutions 


First, we introduce the notion of a convex set. 


Definition 8.11 А set M is called convex if for any two vectors x!,x? є M, any 


convex combination Ax! + (1 А)х2 with 0 < А < 1 also belongs to set M. 
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The definition of a convex set is illustrated in Figure 8.1. In Figure 8.1(a), set M is convex 
since every point of the connecting straight line between the terminal points of vectors x! 
and x? also belongs to set M (for arbitrary vectors x! and x? ending in M). However, set M* 
in Figure 8.1(b) is not convex, since for the chosen vectors x! and x? not every point on the 
connecting straight line between the terminal points of both vectors belongs to set М“. 


X24 X24 


(a) Convex set M (b) Non-convex set M* 


Figure 8.1 A convex set and a non-convex set. 


Definition 8.12 А vector (point) x є M is called the extreme point (or corner point 
or vertex) of the convex set M if x cannot be written as a proper convex combination 
of two other vectors of M, i.e. x cannot be written as 


x = Ах! + (1—A)x? with x!, x? € M and 0 <A < 1. 


Returning to Figure 8.1, set M in part (a) has six extreme points xU, x®,...,x® ог 
equivalently the terminal points P1, P2,..., Pg of the corresponding vectors (here and in 
the following chapter, we always give the corresponding points P; in the figures). 


In the case of two variables, we can give the following geometric interpretation of a system 
of linear inequalities. Assume that the constraints are given as inequalities. The constraints 
апхі + арх Ribi, Ri € {<, >}, і = 1,2,...,m, аге half-planes which are bounded by the 
lines ajjx1 + архә = bj. The ith constraint can also be written in the form 
ХІ x2 
+ ГЕ”, 


Ds 22 =1, 


Sil 512 


where s; = Р; /ад and sj? = bj/aj» are the intercepts of the line with the х] axis and the x2 
axis, respectively (see Figure 8.2). 


The non-negativity constraints x > 0 and x? > 0 represent the non-negative quadrant in 
the two-dimensional space. Thus, when considering a system of m inequalities with two 
non-negative variables, the feasible region is described by the intersection of m half-planes 
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ал21 Qiaro = bj 


Figure 8.2 Representation of a line by x, and x2 intercepts. 


Tı 

Figure 8.3 The set of solutions as the intersection of half-planes. 

with the non-negative quadrant. This is illustrated for the case m = 3 in Figure 8.3, where 
the feasible region M is dashed. In Figure 8.3 and figures which follow we use arrows to 
indicate which of the resulting half-planes of each constraint satisfies the corresponding 


inequality constraint. The arrows at the coordinate axes indicate that both variables have to 
be non-negative. 


In general, we can formulate the following property. 


THEOREM 8.10 The feasible region M of system (8.13) is either empty or a convex set 
with at most a finite number of extreme points. 


PROOF We only prove that, if M 5 Ø, it is a convex set. Let x!, x? є M, i.e. we have 


Ax! R b, x! >0 and Ax? Rb, x? > 0, 
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and we prove that Ax! + (1 — A)x? є M. Then 
арх! epe x | = Ax! + (1 — 04x? R Ab + (1 —2)b = b (8.14) 


and due to Ax! > 0 and (1 — А)х2 > 0 for0 < A < 1, we get 
Ах! + (1—A)x? > 0. (8.15) 


From (8.14) and (8.15), it follows that M is convex. a 


A convex set with a finite number of extreme points is also called a convex polyhedron. One 
of the possible cases of Theorem 8.10 is illustrated in Figure 8.4. Here the values of both 
variables can be arbitrarily large provided that the given two inequalities are satisfied, i.e. 
the feasible region is unbounded. In this case, the feasible region has three extreme points 
Pı, P2 and P3. The following theorem characterizes the feasible region of a system of linear 
inequalities provided that this set is bounded. 


Tı 


Figure 8.4 An unbounded set of solutions M. 


THEOREM 8.11 Let the feasible region M of system (8.13) be bounded. Then it can 
be written as the set of all convex combinations of the extreme points x!,x’,...,x° of 
set M, i.e. 


M =[x € R" Ix = Mx! +5? e A 


S 
белел eq 7p i}. 
i=1 


In the case of only two variables, we can graphically solve the problem. To illustrate the 
determination of set M, consider the following example. 
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Example 8.11 Two goods С and С» are produced by means of two raw materials Rı and 
Кә with the capacities of 50 and 80 units, respectively. To produce 1 unit of Сі, 1 unit of 
Ку and 1 unit of R? are required. To produce 1 unit of G2, 1 unit of R, and 2 units of R2 are 
required. The price of С is 3 EUR per unit, the price of G2 is 2 EUR per unit and at least 60 
EUR worth of goods need to be sold. 

Let x; be the number of produced units of G;, i € {1,2}. A feasible production programme 
has to satisfy the following constraints: 


xy + хэ x 50 (Г) (constraint for R1) 

xq + 2x9 < 80 (D (constraint for R2) 

Зх + 2x2 > 60 (Il) (selling constraint) 
x1,x%2 > 0 (non-negativity constraints) 


This is a system of linear inequalities with only two variables, which can be easily solved 
graphically. The feasible region is given in Figure 8.5. The convex set of feasible solutions 
has five extreme points described by the vectors x’ (or points Р;), i = 1,2,...,5: 


20 50 20 
30 2: 3 
uo ee aba) 
_{ 0 
Therefore, the feasible region M is the set of all convex combinations of the above five 


extreme points: 
20 50 20 0 
(p )+»(%)+з( rmn) 


5 
0 
es( w) À1,22,...,45 = 0, San]. 
i= 


2 
м-|хев 


(Ш) (1) (Ш) 


Figure 8.5 Feasible region for Example 8.11. 
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If the feasible region M is unbounded, there exist (unbounded) one-dimensional rays ema- 
nating from some extreme point of M on which points are feasible. Assume that there are u 
such rays and let r!,r?,...,1r“ denote those vectors pointing from a corresponding extreme 
point in the direction of such an unbounded one-dimensional ray. Then the feasible region 
M can be described as follows. 


THEOREM 8.12 Let the feasible region M of system (8.13) be unbounded. Then it can be 
written as follows: 


0<A; <1, 112; 5; у ped Ii Bas. =O}, 
i=1 
where x!,x,...,x° are the extreme points of set M and r!, r2, . . . , r" are the vectors of the 
unbounded one-dimensional rays of set M. 


According to Theorem 8.12, any feasible solution of an unbounded feasible region M can 
be written as a convex combination of the extreme points and a linear combination of the 
vectors of the unbounded rays with non-negative scalars j4;,j = 1,2,...,u. Considering the 
example given in Figure 8.4, there are two unbounded one-dimensional rays emanating from 
points Р» and P3. 


A relationship between extreme points and basic feasible solutions is given in the following 
theorem. 


THEOREM 8.13 Any extreme point of the feasible region M of system (8.13) corresponds 
to at least one basic feasible solution, and conversely, any basic feasible solution corresponds 
exactly to one extreme point. 


The latter theorem needs to be discussed in a bit more detail. We know from our previous 
considerations that in a basic solution, all non-basic variables are equal to zero. Thus, if the 
coefficient matrix A of the system of linear equations has rank m, at most m variables have 
positive values. We can distinguish the following two cases. 


Definition 8.13 Let M be the feasible region of system (8.13) and let r(4) — m. 
If a basic feasible solution x € M has m positive components, the solution is called 
non-degenerate. If the basic feasible solution x has less than m positive components, 
the solution is called degenerate. 


As we discuss later in connection with linear programming problems in Chapter 9, degeneracy 
of solutions may cause computational problems. In the case where all basic feasible solutions 
are non-degenerate solutions, Theorem 8.13 can be strengthened. 


Linear equations and inequalities 315 


THEOREM 8.14 Теѓ all basic feasible solutions of system (8.13) be non-degenerate solu- 
tions. Then there is a one-to-one correspondence between basic feasible solutions and extreme 
points of the set M of feasible solutions of system (8.13). 


8.2.3 A solution procedure 


According to Theorem 8.11 we have to generate all extreme points in order to describe the 
feasible region of a system of linear inequalities. Using Theorems 8.13 and 8.14, respectively, 
this can be done by generating all basic feasible solutions of the given system. 


In this section, we restrict ourselves to the case when system (8.13) is given in the special form 
Ax<b, x>0, with b>0, (8.16) 


where A is an m x n matrix and we assume that r(A) = m. (The case of arbitrary constraints 
is discussed in detail in the next chapter when dealing with linear programming problems.) 
This particular situation often occurs in economic applications. For instance, it is necessary to 
determine feasible production programmes ofn goods by means of m raw materials, where the 
coefficient matrix A describes the use ofthe particular raw materials per unit of each good and 
the non-negative vector b describes the capacity constraints on the use of the raw materials. 


Forasystem (8.16), inequalities are transformed into equations by introducing a slack variable 
in each constraint, i.e. for the ith constraint 


ахі + арх + +++ + GinXn < bi, 
we write 
айх + арх + +++ + GinXn + uj = bi, 


where и; > 0 is a so-called slack variable (i € {1,2,...,m}). In the following, we use the 
m-dimensional vector u = (u1, и2,..., Um)! for the slack variables introduced in the given 
m inequalities. 


Letting 
* ES X 
A" = (А, Г and x=( aJ? 
the system turns into 
; x 
A*x* — b with х= ( i ) > 0. (8.17) 
In this case, we can choose 
x 0 
x* = = 
u b 
as an initial basic feasible solution, i.e. the variables х], x2,...,x, are the non-basic variables 
and the slack variables и, u2,..., иһ are the basic variables. Moreover, we assume that X is 


a non-degenerate basic feasible solution (which means that b > 0). 
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Starting with this basic feasible solution, we systematically have to generate all other basic 
feasible solutions. To this end, we have to determine the pivot element in each step such 
that the new basic solution is again feasible. (Such pivoting steps are repeated until all basic 
feasible solutions have been visited.) 


For an m x n matrix A with r(A) = m, there are at most ("Т") different basic feasi- 
ble solutions of system (8.17) and therefore at most ("7") extreme points x of the set 
M = {x € R} | Ax x b}. Each pivoting step corresponds to a move from an extreme point 
(represented by some basic feasible solution) to another basic feasible solution (usually corre- 
sponding to another extreme point). We can use the tableau in Table 8.2 to perform a pivoting 
step. Rows 1 to m represent the initial basic feasible solution, and rows m 4- 1 to 2m represent 
the basic solution obtained after the pivoting step. (As before, we could also add an additional 


column to describe the operations that have to be performed in order to get rows m+ 1 to 2m.) 


Table 8.2 Tableau for a pivoting step 


Row bv x e Xp tt Xp ир... Ug e. Um b 

1 uy а] e ац ce ain L us 0 ... 0 bj 
k Uk а tee ак occ akn 0 ... 1 e. 0 bk 
m Um m] э 0g occ атп 0 ... 0 uus bm 
a a a a 

т+1 uw ац — 2а] e 0 ee ain — al kg RAD EC e. 0 bi — 226, 
akl ай 1 bk 
m+k x aud e] ee zu 0. шт epe 0 ТП 
а, ^а а a 

2m иһ 91 — аі e 0 65 ams — аа 0 ... m e. 10 bg— ar bk 


Assume that we choose aj; 4 0 as pivot element, i.e. we replace the basic variable ug by the 
original non-basic variable x;. Then we obtain the new basic solution 


E X 
u 
with 
b 
xt = E... Eh... Xn) = (o.....25.....0) 
ak] 
and 
a a 
ul = (11,..., Hg,... Hy) = СЕКСЕ] 
ац ац 


The new basic solution 
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is feasible if all components of vector x* are non-negative, i.e. 


b 
ko (8.18) 
akl 
and 
bi— L. hes vq i £k. (8.19) 


akl 


Next, we derive a condition such that both (8.18) and (8.19) are satisfied. First, since b > 0 
(remember that we are considering a non-degenerate solution) and aj; 4 0 by assumption, 
we have the equivalence 


br 
аш 


> 0 «= ац > 0. 


This means that in the chosen pivot column, only positive elements are a candidate for the 
pivot element. However, we also have to ensure that (8.19) is satisfied. We get the following 
equivalence: 

а bj b 

b-—L.b 0 «= > t>o foralli with ay > 0. 

ak] ail akl 
This means that we have to take that row as pivot row, which yields the smallest quotient 
of the current right-hand side component and the corresponding element in the chosen pivot 
column among all rows with a positive element in the pivot column. Summarizing, we can 
replace the basic variable и; by the non-basic variable x; if 

by 


. [bi 
ay > 0 and q = HE = min | 
ak] il 


аџ > 0, іє (2... mi. 


If one of the above conditions is violated, i.e. if aj; < 0 or quotient фк = к/а is not 
minimal among all quotients in the rows with a positive element in the pivot column, we do 
not get a basic feasible solution, i.e. at least one component ofthe new right-hand side vector 
would be negative. Therefore, we add a column Q in the tableau in Table 8.2 to calculate 
the corresponding quotients g; which have to be taken into account. Notice that in the new 
tableau, there is again an identity submatrix (with possibly interchanged columns) contained 
in the transformed matrix belonging to the basic variables. 


In the case of a degenerate basic feasible solution, all the above formulas for a pivoting step 
remain valid. In such a case, we have a smallest quotient д = 0 (which, by the way, means 
that also in the case of choosing a negative pivot element a basic feasible solution results). 
We will discuss the difficulties that may arise in case of degeneracy later in Chapter 9 ina 
bit more detail. 


The process of enumerating all basic feasible solutions of system (8.13) can be done by hand 
(without using a computer) only for very small problems. Consider the following example. 


Example 8.12 Consider the system of linear inequalities presented in Example 8.10. Intro- 
ducing slack variables u1, из and из, we obtain the initial tableau in rows 1 to 3 below. Now, 
the goal is to generate all basic feasible solutions of the given system of linear equations. 
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Since each basic feasible solution has three basic variables, there are at most (5) = 10 basic 
feasible solutions. We successively perform pivoting steps and might obtain for instance the 
following sequence of basic feasible solutions: 


Row bv xj x wj uw из b О 


1 ш -2 1 0 0 0 - 

uw 1-1 0 1 0 4 4 

3 uw 1 1 0 1 30 30 

4 wu 0-1 1 2 0 8 - 

5 x 1 -l 1 0 4 - 

6 us 2 0 -1 1 26 13 
3 1 

J ų 0 0 1 34 121 14 
1 1 

зл 1 0 0 ] 1 17 34 
1 1 

9 хх 0 1 0-1 2 m5 - 
2 1 

10 m 0 0 2? 1 oda 42 
1 1 

ll x 1 0 -l 1 10 30 
1 2 

12 ж 0 1 1 0 2 2% 30 


The corresponding pivot elements are printed in bold face. In the first tableau (rows 1 to 3), 
we can choose either the column belonging to x; or that belonging to x2 as pivot column 
(since there is at least one positive element in each of these columns). Selecting the column 
belonging to x1, the quotient is uniquely determined, and we have to replace the basic variable 
из by the non-basic variable х]. In the second pivoting step, we can choose only the column 
belonging to x2 as pivot column (since, when choosing the column belonging to u2, we come 
back to the basic feasible solution represented by rows 1 to 3). 


From the last basic feasible solution, we cannot generate another extreme point. However, we 
can perform another pivoting step so that из becomes a basic variable. Determining now the 
minimum quotient in the О column, it is not uniquely determined: we can choose x, or x2 as the 
variable that becomes a non-basic variable (since in both cases, we have the smallest quotient 
30). If we choose x; as the variable that becomes a non-basic variable, we get the tableau: 


Row bv ху ху wj и u b О 


13 uw -1 0 1 1 0 
14 u 3 0-1 0 1 30 
I5- x3» —2 d 1 0 0 


If we choose in the fourth tableau x? as the variable that becomes a non-basic variable, we 
get the following tableau: 


Row bv X1 x2 ul u? из b Q 
16 w 0 -+ 4 1 0 4 
17 x 1 -4 -i 1 
18 w 0 3 44 0 1 30 
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We still have to show that we have now indeed generated all basic feasible solutions. For 
the second, third and fourth tableaus we have discussed all the possibilities. We still have 
to check the remaining possibilities for the first, fifth and six tableaus. If in the first tableau 
we choose instead of the x; column the x2 column as pivot column, we get the fifth tableau 
(rows 13 to 15). Checking all possibilities for generating a new basic feasible solution in the 
fifth and sixth tableaus, we see that we can generate only such basic feasible solutions as 
we have already found. This means that the remaining possible combinations for selecting 
basic variables which are x2, u1, U3; x2,u1,u5; X1, U1, u2 and x1, x2, из do not lead to a basic 
feasible solution. (One can check this by trying to find all basic solutions.) 


Therefore, in the above example, there are six basic feasible solutions and four basic infea- 
sible solutions. From rows 1 to 18, we get the following basic feasible solutions. (The basic 
variables are printed in bold face.) 


(1 x1 =0, x20, uj; —0, u—4, w=30; 
(2) x =4, x2=0, uj; —8, u;—0, и; = 26; 


(3) x = 17, х = 13, uj —21, m=0, u= 0; 
(4) х = 10, х= 20, и = 0, ш = 14, з = 0; 
(5) x1 20, х= 0, ш= 0, u—4, w = 30; 


(6) х= 0, x20, ш= 0, ш = 4, w = 30. 


Deleting now the introduced slack variables и, их, из, we get the corresponding extreme 
points P, with the coordinates (0, 0), P2 with the coordinates (4, 0), Рз with the coordinates 
(17, 13) and P4 with the coordinates (10, 20). The fifth and sixth basic feasible solutions 
correspond to extreme point P, again. (In each of them, exactly one basic variable has value 
zero.) Therefore, the first, fifth and sixth basic feasible solutions are degenerate solutions 
corresponding to the same extreme point. 


Figure 8.6 Feasible region for Example 8.12. 


The feasible region together with the extreme points is given in Figure 8.6. It can be seen that 
our computations have started from point P1, then we have moved to the adjacent extreme 
point Р», then to the adjacent extreme point Рз and finally to Рд. Then all extreme points 
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have been visited and the procedure stops. According to Theorem 8.10, the feasible region 
is given by the set of all convex combinations of the four extreme points: 


Mal) | este Pale ess Pea) 


4 
Ai > 0, іє {1,2,3,4}, УА = |. 


i=1 


Since the columns belonging to the basic variables are always unit vectors, they can be 
omitted. All required information for performing a pivoting step is contained in the columns 
of the non-basic variables. Therefore, we can use a short form of the tableau, where the 
columns are assigned to the non-basic variables and the rows are assigned to the basic 
variables. Assume that altogether n variables denoted by х], x2,...,x, occur in the system of 
linear inequalities; among them there are m basic variables denoted by xpi, xg2,...,XBm and 
п! = п — т поп-Базѕіс variables denoted by xy, ху2,..., хур. In this case, the short form of 
the tableau is as in Table 8.3. 


Table 8.3 Short form of the tableau for a pivoting step 


Row bv хү ++ ХМ ee хүр 
ж ж ж ж 
1 ХВ1 ai Be к: ту ainw bi 
* eae * к, * * 
[4 XBk ay akı айу br 
m XBm абі а® a b 
хү XNI хүр 
m+1 x а? =a E ee E a* EU b* ирж 
Bl "up ak ay “Ж An ay km Іа в 
= аў а* bt 
m+ k XBk “kI aise de AD kan! 2k 
ап ап ап ар 
ж ж ж ж 
ж ат ж ат ж ат ж x _ lm px 
2т Bm dep аар a Ay бър Um — GEO 
In column bv, the basic variables xg1,xg2,...,XBm are given, and the next n’ columns rep- 
resent the non-basic variables xy1,xy2,..-,XNn’. After the first pivoting step, the Ath basic 


variable is now the former /th non-basic variable: xg, = ху. Correspondingly, the /th non- 
basic variable in the new solution is the former kth basic variable: xy; = хві. Notice that in 
each transformation step we have to write the sequence of the non-basic variables since the 
variables in the columns do not appear according to the numbering. 


Linear equations and inequalities 321 


It is worth emphasizing that, in contrast to the full form of the tableau, the new elements aj; 
of the pivot column are obtained as follows: 


= 1 
а = -7> 
ап 
= au. 
dj — ==, i=1,2,...,m, i £k, 
ап 


i.e. the pivot element is replaced by its reciprocal value and the remaining elements of the 
pivot column are divided by the negative pivot element. Notice that this column also occurs 
in the full form of the tableau, though not in the pivot column (where after the pivoting step 
a unit vector occurs) but it appears in the column of the new non-basic variable xy; in the 
full form of the tableau after the pivoting step. The following example uses the short form 
of this tableau. 


Example 8.13 We consider the following system of linear inequalities: 


x2 < 25 

2x, + x x 30 
—x + x) > —6 
X»X2 = 0. 


We multiply the third constraint by —1 and introduce in each inequality a slack variable 
denoted by u1, u2 and из. This yields a system with three equations and five variables. 


х + uu = 25 
2x, + x + и = 30 
X1 — x2 + us 6 


X1,X2,U],U2,U3 > 0. 


There are at most (3) = 10 basic feasible solutions, each of them including exactly three 
basic variables. In order to describe the feasible region, we have to generate systematically 
all the possible basic feasible solutions. Starting from the first basic solution with ху = 
0,x2 = 0,u4 = 25,u2 = 30, из = 6 and using the short form of the tableau, we obtain e.g. 
the following sequence of basic feasible solutions by subsequent pivoting steps. In the first 
step we have chosen the column belonging to x; as pivot column. (This is possible since at 
least one element in this column is positive; analogously one can also start with the column 
belonging to x2 as the first pivot column.) Using the quotients in the Q column, we have 
found that row 3 must be the pivot row and, therefore, we have the pivot element 1. From the 
tableau given by rows 4 to 6, we have to select the column belonging to variable x2 as pivot 
column (otherwise we come back to the basic feasible solution described by rows 1 to 3). 
The quotient rule determines row 5 as pivot row. Continuing in this way (note that the pivot 
column is now always uniquely determined since in the other case we would always go back 
to the previous basic feasible solution), we get the results in the following tableaus. 
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Row bv x1 x2 b Q 


1 ui 0 1 25 - 
2 uz 2 Т. 30. 15 
3 u3 1 -1 6 6 
u3 x2 
4 u 0 i. 25. 25 
5 и) -2 18 6 
6 x 1 -1 6 - 
u3 u2 
2 1 57 
7 и 3 -3 19 om 
x) -5 3 6 = 
9 x 1 I 12 36 
и] и? 
0 4 ч - $ 
11 x) 1 0 25 - 
2 xu o -j 1 5 5 
uy x] 
13 u; -À 31 
14 х2 1 0 25 
15 uz -1 2 5 


Having generated the above five basic feasible solutions, we can stop our computations for the 
following reasons. There remain five other selections of three variables out of the variables 
X1,X2, U1, U2, из Which, however, do not yield basic feasible solutions. The variables x2, u1, из 
as basic variables lead to a basic infeasible solution. (When selecting x» as the variable that 
becomes a basic variable in the next solution and row 2 as pivot row, this would violate the 
quotient rule since q2 = 30 > д = 25.) Similarly, the choice of x1, u1, из as basic variables 
does not lead to a basic feasible solution since we would violate the quotient rule when 
choosing the x; column as pivot column and row 2 as pivot row. The choice of x1, u2, us 
as basic variables is not possible because when choosing the x; column as pivot column 
and row 1 as pivot row in the first tableau, pivot zero would result and this is not allowed 
(i.e. the corresponding column vectors of matrix A belonging to these variables are linearly 
dependent). The choice of х], хә, из as basic variables leads to a basic infeasible solution 
because when choosing the u2 column as pivot column and row 7 as pivot row in the third 
tableau, the resulting pivot element would be negative. For the same reason, the choice of 
variables x2, u1, u2 as basic variables does not lead to a basic feasible solution. (Selecting the 
x2 column as pivot column and row 3 as pivot row in the first tableau yields the negative 
pivot —1.) Therefore, in addition to the five basic feasible solutions determined from rows 1 
to 15, there are four further basic infeasible solutions. 


From the above five basic feasible solutions, we obtain the resulting five extreme points of 
the feasible region M (by dropping the slack variables u1, из and из). Therefore, set M is 
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given by 


u ХІ 2 
а 


The graphical solution of the system of linear inequalities is given in Figure 8.7. The ith basic 
feasible solution corresponds to extreme point P;. In this example, there is exactly one basic 
feasible solution corresponding to each extreme point since each basic feasible solution is 
non-degenerate. 


Figure 8.7 Feasible region for Example 8.13. 


EXERCISES 


8.1 Decide whether the following systems of linear equations are consistent and find the 
solutions. Apply both methods of Gaussian elimination and pivoting (use the rank 


criterion). 
(a xy + 22 + 3x3 = 5 
2x + 3x2 + хз = 8 
Зх + x. + 2x3 = 5 
(D 3x) + x» + x = 3 
4х] — X2 + 2x4 = 4 
хл — xn + Xy md 
4х\ — xo + 2x3 = 5 
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8.2 


8.3 


8.4 


8.5 


(с) Зх 
3x1 
8х1 
6x1 

(d x 
8x1 
4x1 
5х1 

X1 


++ 


++++ + ++ 


4x2 
8x2 
5x2 
2x2 
2x2 
7x2 
5x2 
4х2 
4х2 


++ 


++++ + ++ 


х3 
бхз 
бхз 
5x3 
3x 3 
6x3 
9x3 
2x3 

x3 


+ 6x4 = 
+ 5x4 = 
+ 7x4 = 
+ 3x4 = 
= 16 
= 74 
49 
= 4 
L2 


л С\ М оо 


(a) Solve the following homogeneous systems of linear equations Ах = 0 with 


(i) A= 


N- о н 


W eRe 


л м We 


о ьо к 


Nr м к 
We We 


AND 
oR We 


(b) Find the solutions of the non-homogeneous system Ax = b with matrix A from 
(i) resp. (ii) and = (0,0, —2, 2)". 


Find the general solutions of the following systems and specify two different basic 
solutions for each system: 


(a 4x 


(D x 


++ 


x2 
x2 
x2 
2x: 2 
4x2 
2x2 
2x2 


3x3 


10x3 
5x3 


+ 5x4 = —2 
— x“ = 4 
= 2x4 = 1 
+ 4x5 = 2 
+ x4 + 3x5 = 1 
+ x4 — xs = ll 
+ 2x4 + 2x5 = 12 


What restriction on the parameter a ensures the consistence of the following system? 
Find the solution depending on a. 


3x + 4y + 


4z 
3x + 2y + 32 
4x + 5y + az 


2 
3 
4 


Check the consistence of the following system as a function of the parameter A: 


3x + 2y + 


2 
бх + 4y + А 


Do the following cases exist? 


(a) There is no solution. 
(b) There is a unique solution. 


Find the solution if possible. 


0 
0 
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8.6 Given is the system 


1 0 0 —1 x] 2 
01-1 1 | |. {з 
00 a 0 x | [1 
0 1 0 b x4 0 


(a) Decide with respect to the parameters a and b in which cases the system is 
consistent or inconsistent. When does a unique solution exist? 

(b) Find the general solution if possible. 

(c) Calculate the solution for a — 1 and b — 0. 


8.7 Decide whether the following system of linear equations is consistent and find the 
solution in dependence on parameters a and b: 


ax + (a+b)y + bz = 3a+5b 
bx + aby + az = a(2b+3)+5 
ax + by + bz = a+5b 


8.8 Find the kernel of the linear mapping described by matrix 


2 —2. 2 
A= |5 =k T 
3 —1 4 


8.9 Given are the two systems of linear equations 


x + 2x2 + 3x4 = 5 X1 + 2x2 + 3x = 14 
Зу + Ix + 2x = 5 and 2x, + 3x2 + хз = 13 
2x1 + 3x2. + 1хз = 8 Зх + x2 + 2x3 = 21. 


Solve them in only one tableau. 


8.10 Find the inverse matrices by pivoting: 


1 3 2 146 
(а) A={ 2 5 3 |; ®© B=]{3 2 1 |; 
=3 =g =й 7 8 8 


1 0 —1 2 
2.-1-2 3 
—1 2 2 -4 
0 1 2 —5 
8.11 Find the solutions of the following matrix equations: 


(a) XA — B with 


-1 3 2 
Aah 25-3 and B= (; E Е); 
-3 -8 -4 
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(b) AXB = C with 


25.11, 3 2 0 LY. 
ics » в- (3 J and dup 3 


(c) XA = —2(X + B) with 


—3 1 1 
A=|{-1 0 1 and в- (1 2 zh 
2 1 -2 
8.12 Assume we have a four-industry economy. Each industry is to produce an output just 
sufficient to meet its own input requirements and those of the other three industries 


as well as the final demand of the open sector. That is, the output level x; must satisfy 
the following equation: 


xj — хп Б хо taa Боха ЕУ» iE {1,2,3,4}, 


where y; is the final demand for industry i and x; is the amount of x; needed as input 
for the industry j € {1,2,3,4}. Let 


5 15 15 10 5 
25 25 10 30 10 
X= @)= | i0 20 20 20 and у= |30 
10 30 15 25 20 


(a) Find the input-coefficient matrix A which satisfies the equation 
x=Ax+y 


with x = (x1,x2,x3,x4)! and xj = ajjxj. 
(b How does the matrix X change if the final demand changes to y = 
(10, 20, 20, 15)T and the input coefficients are constant? 


8.13 Given is the bounded convex set with the extreme points 


2 2 -1 
х= |1 |, х2 = , х= |0 and х^ = 3 
0 1 1 
Do the points 
1 2 1 
a= 5 3 and b-[0 
1 2 


belong to the convex set given above? 


8.14 In addition to its main production a firm produces two products A and B with two 
machines I and II. The products are manufactured on machine I and packaged on 
machine II. Machine I has a free capacity of 40 hours and machine II can be used for 
20 hours. To produce 1 tonne of A takes 4 min on machine I and 6 min on machine II. 
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Producing 1 tonne of B takes 10 min on machine I and 3 min on machine II. What 
output combinations are possible? 


(a) Model the problem by means of a system of linear inequalities. 
(b) Solve the problem graphically. 
(c) Find the general solution by calculation. 


Solve the following problem graphically: 


—X1 uu x. < 4 
—2x, + о < 3 
XT — 2x2 < 1 
xix? > 0 


Find the general solutions of the following systems by calculation: 


(a) 2x1 + 5x? — 2x3 < 5 
4x, + x2 + X3 < 3 
X1,%2,%3 2 0 

(D xi + x2 + хз < 3 
xi + 5x3 x 3 
X1,X2,X3 = 0 


9 Linear programming 


In Chapter 8, we considered systems of linear inequalities and we discussed how to find the 
feasible region of such systems. In this chapter, any feasible solution of such a system is 
evaluated by the value of a linear objective function, and we are looking for a ‘best’ solution 
among the feasible ones. After introducing some basic notions, we discuss the simplex 
algorithm as a general method of solving such problems. Then we introduce a so-called 
dual problem, which is closely related to the problem originally given, and we present a 
modification of the simplex method based on the solution of this dual problem. 


9.1 PRELIMINARIES 


We start this section with an introductory example. 


Example 9.1 A company produces a mixture consisting of three raw materials denoted as 
Rı, R2 and Ёз. Raw materials Rı and R2 must be contained in the mixture with a given mini- 
mum percentage, and raw material R3 must not exceed a certain given maximum percentage. 
Moreover, the price of each raw material per kilogram is known. The data are summarized 
in Table 9.1. 

We wish to determine a feasible mixture with the lowest cost. Let x;,i € {1,2,3}, be the 
percentage of raw material R;. Then we get the following constraints. First, 


xi + хә + x3 = 100. (9.1) 


Equation (9.1) states that the sum of the percentages of all raw materials equals 100 per cent. 
Since the percentage of raw material Ёз should not exceed 30 per cent, we obtain the constraint 


хз < 30. (9.2) 


Table 9.1 Data for Example 9.1 


Raw material Required (%) Price in EUR per kilogram 


К at least 10 25 
Ro at least 50 17 
R3 at most 30 12 
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The percentage of raw material R? must be at least 50 per cent, or to put it another way, the 
sum of the percentages of Rı and Ёз must be no more than 50 per cent: 


xı d- x3 < 50. (9.3) 


Moreover, the percentage of А must be at least 10 per cent, or equivalently, the sum of the 
percentages of А2 and Ёз must not exceed 90 per cent, i.e. 


x2 + xs < 90. (9.4) 
Moreover, all variables should be non-negative: 
120, x220, 2х3 2 0. (9.5) 


The cost of producing the resulting mixture should be minimized, i.e. the objective function 
is as follows: 


z = 25x, + 17x2 + 12x3 —> min! (9.6) 


The notation z —> min! indicates that the value of function z should become minimal for 
the desired solution. So we have formulated a problem consisting of an objective function 
(9.6), four constraints (three inequalities (9.2), (9.3) and (9.4) and one equation (9.1)) and 
the non-negativity constraints (9.5) for all three variables. 


In general, a linear programming problem (abbreviated LPP) consists of constraints (a system 
of linear equations or linear inequalities), non-negativity constraints and a linear objective 
function. The general form of such an LPP can be given as follows. 


General form of an LPP 
Z = сух + C2X2 +... + CnXn —> max! (min!) 
subject to (s.t.) 


ajxi + d12X2 +...+ пхп Ry dy 
ах + d22X2 +...+@ ы К› by 


Am1X1 + 2х2 +... + AmnXn Rm bm 
x > 0, jeJ C {1,2,...,n}, 
where R; € {<, =, >}, 1<i<m. 
An LPP considers either the maximization or the minimization ofa linear function z = e! x. In 


each constraint, we have exactly one ofthe signs <, = or >, i.e. we may have both equations 
and inequalities as constraints, where we assume that at least one inequality occurs. 
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Alternatively, we can give the following matrix representation of an LPP: 
z=c!x — max! (тіп!). 
st. AxRb (9.7) 


x > 0, 


where R = (R1, R2,..., Rm)", Ri € {<,=,>}, i = 1,2,...,m. Here matrix A is of order 
m x n. The vector c = (cj,c2,...,Cn)! is known as the vector of the coefficients in the 
objective function, and the vector b = (b1, b2, . . ., bm)” is the right-hand-side vector. 


The feasibility of a solution of an LPP is defined in the same way as for a system of linear 
inequalities (see Definition 8.10). 


Definition 9.1 А feasible solution x = (x1,x2, . . . ,Xn)!, for which the objective func- 
tion has an optimum (i.e. maximum or minimum) value is called an optimal solution, 
and zo = cTx is known as the optimal objective function value. 


9.2 GRAPHICAL SOLUTION 


Next, we give a geometric interpretation of an LPP with only two variables x; and x2, which 
allows a graphical solution of the problem. 


For fixed value z and c2 Æ 0, the objective function z = суху + cox» is a straight line of the 
form 
C1 2 


хәз = + — 
с? с? 


i.e. for different values of z we get parallel lines all with slope — сі /c2. The vector 


a. 


points in the direction in which the objective function increases most. Thus, when maximizing 
the linear objective function z, we have to shift the line 
C1 2 
x: = —— + — 
c2 c2 
in the direction given by vector с, while when minimizing z, we have to shift this line in the 
opposite direction, given by vector —c. This is illustrated in Figure 9.1. 


Based on the above considerations, an LPP of the form (9.7) with two variables can be 
graphically solved as follows: 


(1) Determine the feasible region as the intersection of all feasible half-planes with the first 
quadrant. (This has already been discussed when describing the feasible region of a 
system of linear inequalities in Chapter 8.2.2.) 
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(2) Draw the objective function z = Z, where Z is constant, and shift it either in the direction 


given by vector c (in the case of z — max!) or in the direction given by vector —c (in the 
case of z — min!). Apply this procedure as long as the line z — constant has common 
points with the feasible region. 

T2| 
Z = CX] + Co9X9 


z — max! 
z — min! 
» 


21 


с 


Figure 9.1 The cases 2 — max! and z — min! 


Example 9.2 A firm can manufacture two goods С and С» in addition to its current 
production programme, where Table 9.2 gives the available machine capacities, the 
processing times for one unit of good С; and the profit per unit of each good С;, i € {1,2}. 


Table 9.2 Data for Example 9.2 


Process Processing times per unit Free machine capacity 
G G in minutes 

Turning 0 8 640 

Milling 6 6 720 

Planing 6 3 600 

Profit in EUR per unit 1 2 — 


(1) First, we formulate the corresponding LPP. Let x; denote the number of produced units 


(2) 


of good G;,i € {1,2}. Then we obtain the following problem: 


z = x1 + 2x2 — max! 


s.t. 8x9 < 640 

6x1 + 6x. x 720 

6x1 + 3x2 < 600 
xxn > 0. 


We solve the above problem graphically (see Figure 9.2). We graph each of the three 
constraints as an equation and mark the corresponding half-spaces satisfying the given 
inequality constraint. The arrows on the constraints indicate the feasible half-space. 
The feasible region M is obtained as the intersection of these three feasible half-spaces 
with the first quadrant. (We get the dashed area in Figure 9.2.) We graph the objective 
function z = Z, where Z is constant. (In Figure 9.2 the parallel lines z = 0 and z = 200 
are given.) The optimal extreme point is P4 corresponding to хт = 40 and x; = 80. 


332 Linear programming 


max 


The optimal objective function value is 20) = хү + 2x; = 40 + 2 - 80 = 200. Notice 
that for z > 200, the resulting straight line would not have common points with the 
feasible region M. 


11 + 25 = 120 
2x, + x3 = 200 


Figure 9.2 Graphical solution of Example 9.2. 


Next, we determine the basic feasible solutions corresponding to the extreme points 
P|, P2,...,Ps. To this end, we introduce a slack variable in each constraint which yields 
the following system of constraints: 


8x2 + x3 = 640 
6x, + 6x2 + x4 = 720 
6x, + 3x2 +X5 600 


Xj, X2, X3, X4, X5 = 0. 


Considering point P, withx; = x? = 0, the basic variables are x3, x4 and xs (and consequently 
the matrix of the basis vectors is formed by the corresponding column vectors, which is an 
identity matrix). The values of the basic variables are therefore 


хз = 640, x4=720 and xs = 600, 


and the objective function value is zı = 0. Considering point Р, we insert x; = 100, x2 = 0 
into the system of constraints, which yields the basic feasible solution 


X] = 100, X2 = 0, X3 = 640, X4 = 120, X5 = 0 


with the objective function value z2 = 100. Considering point P3 with х = 80, хә = 40, 
we obtain from the system of constraints the basic feasible solution 


xı = 80, x2=40, x3= 320, x4=0, x5=0 
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with the objective function value z3 = 160. Considering point P4 with ху = 40, x2 = 80, 
we get from the system of constraints the basic feasible solution 


xı = 40, x2—80, x3—0, x4=0, x5 = 120 


with the objective function value 24 = 200. Finally, considering point Ps with x; = 0, 
x2 = 80, we get the corresponding basic feasible solution 


х= 0, х= 80, хз = 0, x4 = 240, x5 = 360 
with the objective function value 25 = 160. 


The above considerations have confirmed our graphical solution that P4 is the extreme point 
with the maximal objective function value 20) = 24 = 200. 


Example 9.3 We solve graphically the LPP 


2 = х1 + хә > max! 


s.t. хл + x» > 6 
—ах + x) > 4 
x1,%2 = 0, 


where a denotes some real parameter, i.e. a € IR. We first draw the constraints as equations and 
check which of the resulting half-spaces satisfy the given inequality constraint. The arrows 
on the constraints indicate again the feasible half-spaces. This way, we get the feasible region 
M as the intersection of the half-spaces (see Figure 9.3). Since the second constraint contains 
parameter a, we graph in Figure 9.3 the line resulting from the second constraint as equation 
for some values of a, namely: a — —2, a — 0 and a — 2. (M is dashed for a — 2, while for 
a — —2 and a — 0 only the resulting constraint including parameter a is dashed.) 


Figure 9.3 Feasible region M for Example 9.3. 
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It can be seen that all the resulting equations for different values of parameter a go 
through the point (0,4), and the slope of the line is given by 1/a for a zz 0. To find 
the optimal solution, we now graph the objective function z = Z, where Z is con- 
stant. (In Figure 9.3 the lines z — 1 and z — 8 are shown.) The arrow on the lines 
z — constant indicates in which direction the objective function value increases. (Remem- 
ber that the dashed area gives the feasible region M of the problem for a — 2.) From 
Figure 9.3, we see that the function value can become arbitrarily large (independently of 
the value of parameter a), i.e. an optimal solution of the maximization problem does not 
exist. 


We continue with some properties of an LPP. 


9.5 PROPERTIES OF A LINEAR PROGRAMMING PROBLEM; 
STANDARD FORM 


Let M be the feasible region and consider the maximization of the objective function 
z = сх. We know already from Chapter 8.2.2 that the feasible region M of a system of 
linear inequalities is either empty or a convex polyhedron (see Theorem 8.10). Since the 
feasibility of a solution of an LPP is independent ofthe objective function, the latter property 
also holds for an LPP. We can further reinforce this and the following three cases for an LPP 
may occur: 


(1) The feasible region is empty: M = Й. In this case the constraints are inconsistent, 
i.e. there is no feasible solution of the LPP. 

(2) M is a non-empty bounded subset of the n-space IR". 

(3) M is an unbounded subset of the n-space IR", i.e. at least one variable may become 
arbitrarily large, or if some variables are not necessarily non-negative, at least one of 
them may become arbitrarily small. 


In case (2), the feasible region M is also called a convex polytope, and there is always a 
solution of the maximization problem. In case (3), there are again two possibilities: 


(3a) The objective function z is bounded from above. Then an optimal solution of the 
maximization problem under consideration exists. 

(3b) The objective functionz is not bounded from above. Then there does not exist an optimal 
solution for the maximization problem under consideration, i.e. there does not exist a 
finite optimal objective function value. 


Cases (3a) and (3b) are illustrated in Figure 9.4. In case (3a), there are three extreme points, 
and P is an optimal extreme point. In case (3b), there exist four extreme points, however, the 
values of the objective function can become arbitrarily large. 


Linear programming 335 


z — max! 


z 


— constant 


2 = constant 


case (3a) case (3b) 


Figure 9.4 The cases (3a) and (3b). 


THEOREM 9.1 If an LPP has an optimal solution, then there exists at least one extreme 
point, where the objective function has an optimum value. 


According to Theorem 9.1, one can restrict the search for an optimal solution to the consid- 
eration of extreme points (represented by basic feasible solutions). The following theorem 


characterizes the set of optimal solutions. 


THEOREM 9.2 Let P1, P»,..., P, described by vectors xl,x2,..., x be optimal extreme 
points. Then any convex combination 


x = Axl bx +.. H Ax, м >0,їй=1,2,...,”, b» s (9.8) 


is also an optimal solution. 


PROOF Letx!,x2,..., х” be optimal extreme points with 
p р 


ex = cx =... = TY = Baer 


and x^ be defined as in (9.8). Then x is feasible since the feasible region M is convex. 
Moreover, 


etx? = eT (ax! + Арх? +... H Arx”) = А (elxl) + As(elx2) +... + А, (еТх”) 
= №1208 + А220 +... + А201 = (А d А +... + А,)208 = 2018, 


i.e. point x? is optimal. [| 


In Figure 9.5, Theorem 9.2 is illustrated for a problem with two variables having two 
optimal extreme points P, and P2. Any point on the connecting line between the points 
P, and P» is optimal. 
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X25 


2 = constant 


Figure 9.5 The case of several optimal solutions. 
Standard form 


In the next definition, we introduce a special form of an LPP. 


Definition 9.22 An LPP of the form 


z = clx — max! 


s.t. Ax=b, х> 0, 


where А = (Ay,1) and b > 0 is called the standard form of an LPP. 


According to Definition 9.2, matrix A can be partitioned into some matrix Ay and an identity 
submatrix /. Thus, the standard form of an LPP is characterized by the following properties: 


(1) the LPP is a maximization problem; 

(2) the constraints are given as a system of linear equations in canonical form with non- 
negative right-hand sides and 

(3) all variables have to be non-negative. 


It is worth noting that the standard form always includes a basic feasible solution for the 
corresponding system of linear inequalities (when the objective function is skipped). If no 
artificial variables are necessary when generating the standard form or when all artificial 
variables have value zero (this means that the right-hand sides of all constraints that contain 
an artificial variable are equal to zero), this solution is also feasible for the original problem 
and it corresponds to an extreme point of the feasible region M. However, it is an infeasible 
solution for the original problem if at least one artificial variable has a value greater than 
zero. In this case, the constraint from the original problem is violated and the basic solution 
does not correspond to an extreme point of set M. The generation of the standard form of an 
LPP plays an important role in finding a starting solution in the procedure that we present 
later for solving an LPP. 


Any LPP can formally be transformed into the standard form by the following rules. 
We consider the possible violations of the standard form according to Definition 9.2. 


(1) 


Q) 


G) 


(4) 
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Some variable x; is not necessarily non-negative, i.e. x; may take arbitrary values. 
Then variable x; is replaced by the difference of two non-negative variables, i.e. we set: 


ж жж : * жж 
Xj =X; — х) with x zm 0 and ху > 0. 
Then we get: 


> х => лу > 0 


= xj = у= 0 


de dE 


«xj e x < 0. 
The given objective function has to be minimized: 
Z = сіх + cox? t... CnXn > min! 


The determination of a minimum of function z is equivalent to the determination of a 
maximum of function z = —z: 


Z = суху + C2X2 +... + CX, — min! 


SS Z = —g = —C1X1 — C2X2 — ... — CnXn — max! 
For some right-hand sides, we have b; < 0: 
апх + арх +... + dinXn = bj < 0. 


In this case, we multiply the above constraint by —1 and obtain: 


арх — арх — ... — архһ = —bj > 0. 
Let some constraints be inequalities: 
апхі + арх +... + ainXn < bi 
or 
ajiX| + ak2X2 +... + пхп > by. 


Then by introducing a slack variable и; and a surplus variable ик, respectively, we obtain 
an equation: 


апхі + арх +... + AinXn + ui = bj with uj > 0 
Or 


архі + арх +... + ügaXg — uy = bg with uy > 0. 
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(5) Let the given system of linear equations be not in canonical form, i.e. the constraints are 
given e.g. as follows: 


ахі + a12X2 +... + пхп = bı 
а21х1 + а22х2 +... + аһхһ = b2 


ат1Х\ + dg2X2 +... + AmnXn = bm 


with b; 20, i= 1,2,...,m; x; > 0, j =1,2,...,n. 


In the above situation, there is no constraint that contains an eliminated variable with 
coefficient +1 (provided that all column vectors of matrix A belonging to variables 
X1,X2,...,X, are different from the unit vector). Then we introduce in each equation an 
artificial variable x4; as basic variable and obtain: 


a11X1 + d12X2 +... + ахь + X41 =b; 
а21х1 + 42202 +... + аәпхХп - X42 =b 
Am1X1 + ат2х2 + ... + AmnXn + Хат = bm 


with b; > 0, i = 1,2,...,m; x; > 0, j = 1,2,...,n, and x4; > 0, i = 1,2,...,m. 


At the end, we may renumber the variables so that they are successively numbered. (For the 
above representation by хі, x2, ...,Xn+m, but in the following, we always assume that the 
problem includes n variables x1,x2,...,Xn after renumbering.) In the way described above, 
we can transform any given LPP formally into the standard form. (It is worth noting again that 
a solution is only feasible for the original problem if all artificial variables have value zero.) 


For illustrating the above transformation of an LPP into the standard form, we consider the 
following example. 


Example 9.4 Given is the following LPP: 


z = —x1 + 3x2 + x4 — min! 
s.t. xy — x + 3x3 — ха > 8 
X2 = 5x3 + 2x4 < -—4 
з + x < 3 
X2,X3,X4 > 0. 


First, we substitute for variable x, the difference of two non-negative variables xt and хї*, 
i.e. x] = xt — xD with xt > 0 and x3" > 0. Further, we multiply the objective function z 
by —1 and obtain: 


Z = —2 = Хү — Хү? — 3x2 — ха — max! 
s.t. xt — x = x) + 3x — ха > 8 
x. = 5x3 + 2x4 < -—4 
Ху: X4 € 3 
роу X4 > 0. 


Multiplying the second constraint by —1 and introducing the slack variable x7 in the third 
constraint as well as the surplus variables x5 and xg in the first and second constraints, we 
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obtain all constraints as equations with non-negative right-hand sides: 


2 = —7 = Хү — x5* — 3x2 — x4 — max! 
s.t. xt — xr — X2 + 3x3 — x4 — xs = 8 
= x2 + 5x3 — 2x4 — X6 — 4 
X3 + X4 + х7 3 
0. 


ж жж 
Xp > X2, X3, X4, X5, X6, X7 = 


Now we can choose variable хт as eliminated variable in the first constraint and variable x7 
as the eliminated variable in the third constraint, but there is no variable that occurs only in 
the second constraint having coefficient +1. Therefore, we introduce the artificial variable 
x41 in the second constraint and obtain: 


2 = =z = Хү — x5* — 3x2 — x4 —> max! 
st = xi* — х) + 3x3 — x4— x5 + хү = 8 
— X2 + 5х3 — 2x4 — X6 + XA1 =4 
X3 + x4 +x7=3 


X13] 5,230,33, X4, X5, X6, X7, X41 = 0. 


Notice that we have written the variables in such a way that the identity submatrix (column 
vectors of variables хү, x41, x7) occurs at the end. So in the standard form, the problem has 
now n — 9 variables. A vector satisfying all constraints is only a feasible solution for the 
original problem if the artificial variable x4; has value zero (otherwise the original second 
constraint would be violated). 


9.4 SIMPLEX ALGORITHM 


In this section, we always assume that the basic feasible solution resulting from the standard 
form of an LPP is feasible for the original problem. (In particular, we assume that no artificial 
variables are necessary to transform the given LPP into the standard form.) We now discuss 
a general method for solving linear programming problems, namely the simplex method. The 
basic idea of this approach is as follows. Starting with some initial extreme point (represented 
by a basic feasible solution resulting from the standard form of an LPP), we compute the 
value ofthe objective function and check whether the latter can be improved upon by moving 
to an adjacent extreme point (by applying the pivoting procedure). If so, we perform this 
move to the next extreme point and seek then whether further improvement is possible by a 
subsequent move. When finally an extreme point is attained that does not admit any further 
improvement, it will constitute an optimal solution. Thus, the simplex method is an iterative 
procedure which ends after a finite number of pivoting steps in an optimal extreme point 
(provided it is possible to move in each step to an adjacent extreme point). This idea is 
illustrated in Figure 9.6 for the case z — max! Starting from extreme point P1, one can go 
via points Р» and P5 to the optimal extreme point Рд or via Pa, P5, P4 to Рд. In both cases, 
the objective function value increases from extreme point to extreme point. 


In order to apply such an approach, a criterion to decide whether a move to an adjacent 
extreme point improves the objective function value is required, which we will derive in 
the following. We have already introduced the canonical form of a system of equations in 
Chapter 8.1.3 (see Definition 8.7). In the following, we assume that the rank of matrix A is 
equal to m, i.e. in the canonical form there are m basic variables among the n variables, and 
the number of non-basic variables is equal to n' 2 n — m. Consider a feasible canonical form 
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with the basic variables xg; and the non-basic variables ху: 


n 
xpi = b — у) ` аху, i=1,2,...,m (и = n— m). (9.9) 
ja 


Partitioning the set of variables into basic and non-basic variables, the objective function z 
can be written as follows: 


Zz = сіх +02X2 +... + CnXn 
= CB1XB1 + CB2XB2 +... + CBmXBm + CN1XN1 + CN2XN2 +... + См Хун! 
ГА 
т п 


У `свихві + У `сухм. 


i=1 j=l 


Using equations (9.9), we can replace the basic variables and write the objective function 
only in dependence on the non-basic variables. We obtain 


N 
Il || 
Me iM 
Ed S 
© 
| Sx 
| 
М... 
жогы] 
Mes EX 
о E 
= 
+ 
aat 
E = 
E = 


We refer to the latter row, where the objective function is written in terms of the current 
non-basic variables, as the objective row. Moreover, we define the following values: 


m 

gj = Qu cpid; — сму (coefficient of variable xy; in the objective row); (9.10) 
i=1 
m 

20 = у, Cpib; (value of the objective function of the basic solution). (9.11) 
i=1 


z — max! 


2 = constant 


Figure 9.6 Illustration of the simplex method. 
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Table 9.3 Simplex tableau of the basic feasible solution (9.9) 


No. nbv xn, XN2 сз XN co^ XW 
bw -1 см см sc ем сс суу 0 Q 
хві сві ајр а ay ay 0] 
XB) CB а ауу d а 5 
XBk с а а„ c5 gr р 
ХВт СВт Әр р с а сс Әр Dy, 
2 g 92 91 gw — Z0 


Concerning the calculation of value zo according to formula (9.11), we remember that in a 
basic solution, all non-basic variables are equal to zero. 


Then we get the following representation of the objective function in dependence on the 
non-basic variables xy: 


Zz = 20 — g1XN1 92XN2 m Ün'XNn'- 


Here each coefficient g; gives the change in the objective function value if the non-basic 
variable xy; is included in the set of basic variables (replacing some other basic variable) and 
if its value would increase by one unit. By means of the coefficients in the objective row, we 
can give the following optimality criterion. 


THEOREM 9.3 (optimality criterion) If inequalities g; > 0, j = 1,2,...,n', hold for the 
coefficients of the non-basic variables in the objective row, the corresponding basic feasible 
solution is optimal. 


From Theorem 9.3 we get the following obvious corollary. 


COROLLARY 9.1 If there exists a column / with фу < 0 in a basic feasible solution, the 
value of the objective function can be increased by including the column vector belonging 
to the non-basic variable xy; into the set of basis vectors, i.e. variable xy; becomes a basic 
variable in the subsequent basic feasible solution. 


Assume that we have some current basic feasible solution (9.9). The corresponding simplex 
tableau is given in Table 9.3. It corresponds to the short form of the tableau of the pivoting 
procedure as introduced in Chapter 8.2.3 when solving systems of linear inequalities. An 
additional row contains the coefficients g; together with the objective function value zo 
(i.e. the objective row) calculated as given above. 


In the second row and column ofthe simplex tableau in Table 9.3, we write the coefficients of 
the corresponding variables in the objective function. Notice that the values of the objective 
row in the above tableau (i.e. the coefficients g; and the objective function value zo) are 
obtained as scalar products of the vector (—1,cg1,cg2,...,¢€Bm)! of the second column 
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and the vector of the corresponding column (see formulas (9.10) and (9.11)). Therefore, 
numbers —1 and 0 in the second row are fixed in each tableau. In the left upper box, we 
write the number of the tableau. Note that the basic feasible solution (9.9) represented by 
Table 9.3 is not necessarily the initial basic feasible solution resulting from the standard form 


(in the latter case, we would simply have аў = aj and b? = bj fori = 1,2,...,m and 
j= 1,2,...,п provided that A = (aj) is the matrix formed by the column vectors belonging 


to the initial non-basic variables). 


If in the simplex tableau given in Table 9.3 at least one of the coefficients фу is negative, 
we have to perform a further pivoting step, i.e. we interchange one basic variable with a 
non-basic variable. This may be done as follows, where first the pivot column is determined, 
then the pivot row, and in this way the pivot element is obtained. 


Determination of the pivot column 1 


Choose some column /, 1 < / < п’, such that д; < 0. Often, a column / is used with 
gi = min(g; | g; < 0, j = 1,2,..., m). 


Itis worth noting that the selection of the smallest negative coefficient g; does not guarantee 
that the algorithm terminates after the smallest possible number of iterations. It guarantees 
only that there is the biggest increase in the objective function value when going towards the 
resulting subsequent extreme point. 


Determination of the pivot row k 


We recall that after the pivoting step, the feasibility of the basic solution must be maintained. 
Therefore, we choose row k with 1 < k « m such that 


br bt 
ko vus i 
qe c a 


ап il 


аў > 0, =. 


It is the same selection rule which we have already used for the solution of systems of linear 
inequalities (see Chapter 8.2.3). To determine the above quotients, we have added the last 
column Q in the tableau given in Table 9.3, where we enter the quotient in each row in which 
the corresponding element in the chosen pivot column is positive. 


If column / is chosen as pivot column, the corresponding variable xy; becomes a basic 
variable in the next step. We also say that xy; is the entering variable, and the column of the 
initial matrix A belonging to variable xy; is entering the basis. Using row K as pivot row, the 
corresponding variable xg; becomes а non-basic variable in the next step. In this case, we say 
that xpy is the leaving variable, and the column vector of matrix A belonging to variable xy; 
is leaving the basis. Element аў, is known as the pivot or pivot element. It has been printed 
in bold face in the tableau together with the leaving and the entering variables. 


The following two theorems characterize situations when either an optimal solution does not 
exist or when an existing optimal solution is not uniquely determined. 


THEOREM 9.4 If inequality g; < 0 holds for a coefficient of a non-basic variable in the 
objective row and inequalities a? < 0, i = 1,2,...,m, hold for the coefficients in column / 
of the current tableau, then the LPP does not have an optimal solution. 
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In the latter case, the objective function value is unbounded from above, and we can stop 
our computations. Although there is a negative coefficient gj, we cannot move to an adjacent 
extreme point with a better objective function value (there is no leaving variable which can 
be interchanged with the non-basic variable ху). 


THEOREM 9.5 If there exists a coefficient gj = 0 in the objective row of the tableau of 
an optimal basic feasible solution and inequality aù > 0 holds for at least one coefficient 
in column /, then there exists another optimal basic feasible solution, where xy; is a basic 
variable. 


If the assumptions of Theorem 9.5 are satisfied, we can perform a further pivoting step with 
хм as entering variable, and there is at least one basic variable which can be chosen as leaving 
variable. However, due to g; — 0, the objective function value does not change. Based on 
the results above, we can summarize the simplex algorithm as follows. 


Simplex algorithm 


(1) Transform the LPP into the standard form, where the constraints are given in canonical 
form as follows (remember that it is assumed that no artificial variables are necessary to 
transform the given problem into standard form): 


ANXN + Ixg = b, xy = 0, хв > 0, b — 0, 


where Ay = (aj) is of order m x n' and b = (bj, b5,.. ., bm)". The initial basic feasible 
solution is 


-(Ш-(@) 


with the objective function value 20 = c! x. Establish the corresponding initial tableau. 


(2) Consider the coefficients g;, j = 1,2,...,n', of the non-basic variables xy; in the 
objective row. 
If gj > 0 forj = 1,2,...,n’, then the current basic feasible solution is optimal, stop. 


Otherwise, there is a coefficient g; < 0 in the objective row. 
(3) Determine column / with 


gı = min{gj; | gj < 0, j 52,2] 


as pivot column. 

(4) If aj < 0 fori = 1,2,...,m, then stop. (In this case, there does not exist an optimal 
solution of the problem.) Otherwise, there is at least one element aj; > 0. 

(5) Determine the pivot row k such that 


by | bi 
—— = min — 
аы ail 


aij > 0, 12... 
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(6) Interchange the basic variable xg, of row k with the non-basic variable xy; of column / 
and calculate the following values of the new tableau: 


sd 
kl T > 
akl 
aij by t feu 
ар = —; b% = —; j=1,2,...,n", jÆ l; 
akl akl 
ail 
ай=——; i= 1,2,...,т, i Æ k; 
akl 
ай ап 
„жшт ай; асат bk; 


gi 
g = -—; 
ак 
91 А 
Gj = 8 — сй; J=1,2,....0, 745 
2) = 20 — a by 


Consider the tableau obtained as a new starting solution and go to step 2. 


It is worth noting that the coefficients g* and the objective function value zj in the objective 
row of new tableau can also be obtained by means of formulas (9.10) and (9.11), respectively, 
using the new values ар and b. 


If in each pivoting step the objective function value improves, the simplex method certainly 
terminates after a finite number of pivoting steps. However, it is possible that the objective 
function value does not change after a pivoting step. Assume that, when determining the pivot 
row, the minimal quotient is equal to zero. This means that some component of the current 
right-hand side vector is equal to zero. (This always happens when in the previous tableau 
the minimal quotient was not uniquely defined.) In this case, the basic variable xg; has the 
value zero, and in the next pivoting step, one non-basic variable becomes a basic variable 
again with value zero. Geometrically, this means that we do not move to an adjacent extreme 
point in this pivoting step: there is only one non-basic variable interchanged with some basic 
variable having value zero, and so it may happen that after a finite number of such pivoting 
steps with unchanged objective function value, we come again to a basic feasible solution 
that has already been visited. So, a cycle occurs and the procedure would not stop after a 
finite number of steps. We only mention that there exist several rules for selecting the pivot 
row and column which prevent such a cycling. One of these rules can be given as follows. 


Smallest subscript rule. If there are several candidates for the entering and/or leaving 
variables, always choose the corresponding variables having the smallest subscript. 


The above rule, which is due to Bland, means that among the non-basic variables with negative 
coefficient g; in the objective row, the variable with the smallest subscript is taken as the 
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entering variable and if, in this case, the same smallest quotient is obtained for several rows, 
then among the corresponding basic variables again the variable with the smallest subscript 
is chosen. However, since cycling occurs rather seldom in practice, the remote possibility of 
cycling is disregarded in most computer implementations of the simplex algorithm. 


Example 9.5 Let us consider again the data given in Example 9.2. We can immediately 
give the tableau for the initial basic feasible solution: 


1 nw xj Xx 


b -1 1 2 x4 


X3 0 0 8 640 80 

х4 0 6 6 720 120 

X5 0 6 3 600 200 
—1 —2 0 


Choosing х2 as entering variable, we get the quotients given in the last column of the above 
tableau and thus we choose x3 as leaving variable. (Hereafter, both the entering and leaving 
variables are printed in bold face.) Number 8 (also printed in bold face) becomes the pivot 
element, and we get the following new tableau: 


x2 0 1 80 — 
х4 6 -} 240 40 
xs 6 -$ 360 60 
1 
-1 $ 160 


Now, the entering variable x; is uniquely determined, and from the quotient column we find 
x4 as the leaving variable. Then we obtain the following new tableau: 


» 2 0 i 8 
x t -i 40 
xs 0 -1 {$ 120 
1 1 
l 4 200 


From the last tableau, we get the optimal solution which has already been found in 
Example 9.2 geometrically, i.e. the basic variables x,,x2. and xs are equal to the corre- 
sponding values of the right-hand side: х = 40, x? = 80, xs = 120, and the non-basic 
variables x3 and x4 are equal to zero. We also see that the simplex method moves in each step 
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from an extreme point to an adjacent extreme point. Referring to Figure 9.2, the algorithm 
starts at point P1, moves then to point Ps and finally to the optimal extreme point Рд. If in the 
first tableau variable x were chosen as the entering variable (which would also be allowed 
since the corresponding coefficient in the objective row is negative), the resulting path would 
be P1, P2, Рз, Рд, and in the latter case three pivoting steps would be necessary. 


Example 9.6 А firm intends to manufacture three types of products Р, P2 and Р» so that 
the total production cost does not exceed 32,000 EUR. There are 420 working hours possible 
and 30 units of raw materials may be used. Additionally, the data presented in Table 9.4 are 
given. 


Table 9.4 Data for Example 9.6 


Product Py Р») P3 
Selling price (EUR/piece) 1,600 3,000 5,200 
Production cost (EUR/piece) 1,000 2,000 4,000 
Required raw material (per piece) 3 2 2 
Working time (hours per piece) 20 10 20 


The objective is to determine the quantities of each product so that the profit is maximized. 
Let x; be the number of produced pieces of Pj, i є (1,2, 3). We can formulate the above 
problem as an LPP as follows: 


z = 6x; + 10x2 + 12x3 — max! 


s.t. xj + 2x2 + 4x3 < 32 
3x1 + 2x2 + 2x3 < 30 

2x, + x + 2x3 < 42 
X1, X2, X3 > 0. 


The objective function has been obtained by subtracting the production cost from the selling 
price and dividing the resulting profit by 100 for each product. Moreover, the constraint 
on the production cost has been divided by 1,000, and the constraint on the working time 
by 10. 

Introducing now in the ith constraint the slack variable x34; > 0, we obtain the standard form 
together with the following initial tableau: 


X4 0 1 2 4 32 8 
X5 0 3 2 2 30 15 
X6 0 2 1 2 42 21 
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Choosing x3 now as the entering variable (since it has the smallest negative coefficient in the 
objective row), variable x4 becomes the leaving variable due to the quotient rule. We obtain: 


1 1 1 
x 12} 4 1 8 16 
х5 $ 1 -} 14 14 
X6 3 0 -i 26 - 
-3 -4 3 96 


x; 12 -1 -4 4 1| 
x; 10 3 1 -4 14 
x; 0 3 0 -4 26 

7 4 1 132 


Since now all coefficients g; are positive, we get the following optimal solution from the 
latter tableau: 


х= 0, x2—14, x3—1, x4—0, x5=0, xg = 26. 
This means that the optimal solution is to produce no piece of product P1, 14 pieces of product 


Р» and one piece of product Рз. Taking into account that the coefficients of the objective 
function were divided by 100, we get a total profit of 15,200 EUR. 


Example 9.7 We consider the following LPP: 


z = —2x, — 2x? — min! 
s.t. xy — x» > -1 

—x| + 2x? < 
xX1,xX2 > 0. 


First, we transform the given problem into the standard form, i.e. we multiply the objective 
function and the first constraint by — 1 and introduce the slack variables хз and хд. We obtain: 


Z = 2х1 + 2x? — max! 
St. == + X2 + x = 1 
-x + 2x2 +. 24 - = 

X1, X2, X3, X4 > 0. 


348 Linear programming 


Now we can establish the first tableau: 


1 nbv X1 x2 
bw -1 2 200g 
з O -1 1 1] 1 
x 0 -1 2 4 

—2 —2 0 


Since there are only negative elements in the column of variable x1, only variable x? can be 
the entering variable. In this case, we get the quotients given in the last column of the latter 
tableau and therefore, variable x3 is the leaving variable. We obtain the following tableau: 


2 nbv X1 X3 


bw -1 2 000g 

gus 2-1 11 - 

xc 0 1-2 2 2 
-4 22 


In the latter tableau, there is only one negative coefficient of a non-basic variable in the 
objective row, therefore variable x; becomes the entering variable. Since there is only one 
positive element in the column belonging to ху, variable x4 becomes the leaving variable. 
We obtain the following tableau: 


3 nbw x4 хз 
bv -1 0 0 оо 
X2 2 1 -1 3 
x] 2 1-2 2 
—6 10 


Since there is only one negative coefficient of a non-basic variable in the objective row, 
variable x3 should be chosen as entering variable. However, there are only negative elements 
in the column belonging to x3. This means that we cannot perform a further pivoting step, 
and so there does not exist an optimal solution of the maximization problem considered (i.e. 
the objective function value can become arbitrarily large, see Theorem 9.4). 


Example 9.8 Given is the following LPP: 


Zz = X| + X2 + x3 + x4 + x5 + x6 — min! 


s.t. 2x1 +x2+x3 > 4,000 
x2 +2x4+ Xs > 5,000 
X3 + 2x5 + 3x6 > 3,000 


X1,X2,X3, X4, X5, X6 2 0. 


To get the standard form, we notice that in each constraint there is one variable that occurs 
only in this constraint. (Variable x; occurs only in the first constraint, variable x4 only in 
the second constraint and variable xg only in the third constraint.) Therefore, we divide the 
first constraint by the coefficient 2 of variable x1, the second constraint by 2 and the third 
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constraint by 3. Then, we introduce a surplus variable in each of the constraints, multiply the 
objective function by —1 and obtain the standard form. (Again the variables are written in 
such a way that the identity submatrix of the coefficient matrix occurs now at the end.) 


Z = —Z = —xX] — X2 — X3 — Х4 — Xs — xg — max! 
st — 5x2 + 4x3 = x7 +x] = 2,000 
152 + 5X5 — Xg -FX4 —2,500 
5x3 + 2х5 — X9 +x6 = 1,000 


X1,X2,X3, X4, X5, X6, X7, X8, X9 2 0. 


This yields the following initial tableau: 


bw —1 —1 -1 1 0 0 0 0 Q 
x -l 1 id 0-100 200 - 
x -1 1 o 4 o-1 0 250 5,000 
x; -1 0 4 0 0-1 1,000 1,500 
1 1 
o i1 - 1 1 1 -550 


Choosing now xs as entering variable, we obtain the quotients given in the last column of the 
above tableau and therefore, xg is chosen as leaving variable. We obtain the following tableau: 


x -l 4 4 0 -1 0 0 2,000 
1 1 3 3 

x -1 1 “4 ci 0 -1 4 1,750 

xs -l1 0 $ 3 0 0 -$ 1,500 
1 1 3 

0 1 l1 1 1 j —5,250 


Now all coefficients of the non-basic variables in the objective row are non-negative and 
from the latter tableau we obtain the following optimal solution: 


xı = 2,000, X2 = x3 = 0, x4 = 1,750, xs = 1,500, xg = 0 


with the optimal objective function value 200 = —5, 250 which corresponds to zm = 5,250 
(for the original minimization problem). Notice that the optimal solution is not uniquely 
determined. In the last tableau, there is one coefficient in the objective row equal to zero. 
Taking x2 as the entering variable, the quotient rule determines x4 as the leaving variable, 
and the following basic feasible solution with the same objective function value is obtained: 


х= 250, x. = 3,500, x3=x4=0, x5= 1,500, х =0. 
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9.5 TWO-PHASE SIMPLEX ALGORITHM 


In this section, we discuss the case when artificial variables are necessary to transform a 
given problem into standard form. In such a case, we have to determine a basic solution 
feasible for the original problem that can also be done by applying the simplex algorithm. 
This procedure is called phase I of the simplex algorithm. It either constructs an initial basic 
feasible solution or recognizes that the given LPP does not have a feasible solution at all. If 
a feasible starting solution has been found, phase I of the simplex algorithm starts, which 
corresponds to the simplex algorithm described in Chapter 9.4. 


The introduction of artificial variables is necessary when at least one constraint is an equation 
with no eliminated variable that has coefficient 4-1, e.g. the constraints may have the following 
form: 


a11 Xi + a12X2 + +++ + ах = bi 
a21 x1 + d22X2 + +++ + аһхһ = b2 


ат1Х\ + dg2X2 + +++ + amnXn = bm 


x > 0, j= 1,2,...,n; bijz0,i-1,2,...,m. 


As discussed in step (5) of generating the standard form, we introduce an artificial variable 
x4j in each equation. Additionally we replace the original objective function z by an objective 
function z; minimizing the sum of all artificial variables (or equivalently, maximizing the 
negative sum of all artificial variables) since it is our goal that all artificial variables will 
get value zero to ensure feasibility for the original problem. This gives the following linear 
programming problem to be considered in phase I: 


Z] — —XA] XA2 sate XAm > max! 
s.t. ац x1 + 12х02 + ... + d1nXn + XA1 = 
a21 X1 + а22х2 + ... + A2nXn + x42 —b 
(9.12) 
ат1Х1 + am2X2 +... + AmnXn + хат = bm 
x 20,7=1,2,...,n and xg >0,i=1,2,...,m. 


The above problem is the standard form of an LPP with function z replaced by function 
zr. The objective function zy used in the first phase is also called the auxiliary objective 
function. When minimizing function zz, the smallest possible objective function value is 
equal to zero. It is attained when all artificial variables have value zero, i.e. the artificial 
variables have become non-basic variables, or possibly some artificial variable is still a 
basic variable but it has value zero. When an artificial variable becomes the leaving variable 
(and therefore has value zero in the next tableau), it never will be the entering variable again, 
and therefore this variable together with the corresponding column can be dropped in the new 
tableau. 
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Assume that we have determined an optimal solution of the auxiliary problem (9.12) by the 
simplex method, i.e. the procedure stops with g; > 0 for all coefficients of the non-basic 


variables in the objective row (for the auxiliary objection function zz). Then, at the end of 
phase Z, the following cases are possible: 


(1) Wehavezy'?* < 0. Then the initial problem does not have a feasible solution, i.e. M = Ø. 
(2) We have 27% = 0. Then one of the following cases occurs: 


(a) All artificial variables are non-basic variables. Then the basic solution obtained 
represents a feasible canonical form for the initial problem, and we can start with 
phase II of the simplex algorithm described in Chapter 9.4. 

(b) Among the basic variables, there is still an artificial variable in row k: xg, = x4; = 0 
(degeneration case). Then we have one of the two possibilities: 


(i) In the row belonging to the basic variable x4; = 0, all coefficients are also 
equal to zero. In this case, the corresponding equation is superfluous and can 
be omitted. 

(ii) In the row belonging to the basic variable x4; = 0, we have ат, 5 0 in the 
tableau according to Table 9.3 (with function zz) for at least one coefficient. 
Then we can choose aj; as pivot element and replace the artificial variable x4; 
by the non-basic variable хуу. 


We illustrate the two-phase simplex algorithm by the following three examples. 


Example 9.9 Given is the following LPP: 


2 = x, — 2х2 > max! 


s.t. x) + x < 4 
2x} = x > 1 
xix? > 0. 


We transform the given problem into standard form by introducing a surplus variable (x3) 
in the second constraint, a slack variable (x4) in the first constraint and an artificial variable 
(x41) in the second constraint. Now we replace the objective function z by the auxiliary 
function zz. Thus, in phase I of the simplex method, we consider the following LPP: 


ZI = —x4j — max! 

s.t. X) + x + х4 = 4 

2x1} = x2 — x + xq = 1 

X1,32,X3,X4,X41 > 0 

We start with the following tableau: 
1 nbv X1 x2 x3 

w -1 0 0 0 оро 
x4 0 1 1 0 44 
xa —1 2 -1 -1 14 
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Choosing x; as entering variable gives the quotients presented in the last column of the above 
tableau, and variable x4; becomes the leaving variable. This leads to the following tableau: 


= 
с 
| 
= 
| 
= 
© 
© 
© 
tO 


© N= NN 


Now phase I is finished, we drop variable хл and the corresponding column, use the original 
objective function and determine the coefficients g; of the objective row. This yields the 
following tableau: 


N 

* 
s 

S 
es 
+ 
N 
> 
ө 


© 
е 
| 
= 
| 
кю 
© 
© 
tO 


3 1 7 
x4 0 5 55 7 
1 1 1 
*1 no o 
3 _1 1 
2 2 2 


Due to the negative coefficient in the objective row, we choose x3 as the entering variable 
in the next step and variable x4 becomes the leaving variable. Then we obtain the following 
tableau 


Since all coefficients g; are non-negative, the obtained solution is optimal: x; = 4,x2 = 0. 
The introduced surplus variable x5 is equal to seven while the introduced slack variable x4 is 
equal to zero. The optimal objective function value is 20 = 4. The graphical solution of this 
problem is illustrated in Figure 9.7. We see from Figure 9.7 that the origin of the coordinate 
system with variables x; and x2 is not feasible since the second constraint is violated. This was 
the reason for introducing the artificial variable x4; which has initially the value one. After 
the first pivoting step, we get a feasible solution for the original problem which corresponds 
to extreme point P; in Figure 9.7. Now phase II ofthe simplex algorithm starts, and after the 
next pivoting step we reach an adjacent extreme point P? which corresponds to an optimal 
solution of the considered LPP. 


Let us consider another LPP assuming that the objective function changes now to 


Z = —x| + 3x? — min! 
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Can we easily decide whether the optimal solution for the former objective function is also 
optimal for the new one? We replace only the coefficients cı and c? of the objective func- 
tion in the last tableau (again for the maximization version of the problem), recompute the 
coefficients g; of the objective row and obtain the following tableau: 


3 nbv X2 X4 


a 
= 
un 
є_ № 
AN 


Since also in this case all coefficients g; in the objective row are non-negative, the solution 
Xj = 4, x? = 0 is optimal for Z = —x; + 3x; —> min as well with a function value 
Z0 ™ = —4. This can also be confirmed by drawing the objective function 2 in Figure 9.7. 


Figure 9.7 Graphical solution of Example 9.9. 


Example 9.10 We consider the data given in Example 9.1 and apply the two-phase simplex 
method. Transforming the given problem into standard form we obtain: 


Z = —23x, — 17x2 — 12x3 — max! 
s.t. XQ + X2 + x3 + ха = 100 
X3 + ха = 30 
ХІ + x3 + xs = 50 
ә + x3 + xg = 90 
X1,X2,X3,X4,X5,X6,X41 > 0 
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Starting with phase I of the simplex method, we replace function z by the auxiliary objective 
function 


ZI = —x4|j —9 max! 


We obtain the following initial tableau: 


bw -1 0 0 0 0 о 
xa -l 1 1 1 100 100 
xp 0 0 9 1 30 — 
xs 0 1 0 1 50 50 
xs 001 1 bir. i 

D NES NN —100 


Choosing x, as the entering variable, we get the quotients given above and select xs as the 
leaving variable. This leads to the following tableau: 


xa —1 cl 1 0 50 50 
x4 о0о о 0 1 30 — 
x] Qu 1 0 1 50 — 
х6 0 0 1 1 90 90 

1 -1 0 —50 


Now x» becomes the entering variable and the artificial variable x41 is the leaving variable. 
We get the following tableau, where the superfluous column belonging to x41 is dropped. 


x 0 -1 0 50 
x4 0 0 1 30 
xy 0 1 1 50 
xs 0 1 1 40 

о о 0 


Now, phase I is finished, апа we can consider the objective function 


Z = —25x, — 17x; — 12x3 —> max! 
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We recompute the coefficients in the objective row and obtain the following tableau: 


3*  nbv xs X3 


bv -1 0 -12 0 Q 
à; —17 -1 0 50 — 
x4 0 0 1 30 30 
xj —25 1 1 50 50 
xg 0o 1 1 40 40 


—8 —13 —2,100 


We choose x3 as entering variable and based on the quotients given in the last column, x4 is 
the leaving variable. After this pivoting step, we get the following tableau: 


4 nb xs x4 


bw -1 0 0 о Q 

xj —17 -1 0 50 — 

x3 cz 0 d 30 — 

x —25 1 -1 20 20 

X6 p» cds ccu] 10 10 
-8 13 —1,710 


We choose xs as entering variable and xg as leaving variable which gives the following 
tableau: 


bw -1 0 0 0 о 
о = 60 
x3 —12 30 
Ж 25 d 10 
xs OF = Е 10 
8 5 1,630 


This last tableau gives the following optimal solution: 
xı = 10, х= 60, x3—30, x4=0, x5=10, x6 =0 


with the objective function value gmin = 1, 630 for the minimization problem. 


Example 9.11 Consider the following LPP: 


z = x, + 2x2 — max! 


S.t. xj, — xj) > 1 
5x; — 2x2 < 3 
X1,x2 > 0. 
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Transforming the above problem into standard form, we obtain 


2 = x, + 2x2 max! 
St. x — X2 — X3 + х4 
5x; — 2x2 + ха = 
X1, X2, X3, X4, XA1 > 0. 


1 


This leads to the following starting tableau for phase I with the auxiliary objective function 
2] = —x41 — max! 


© 
е 
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nm 
o 
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© 
© 
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We choose now variable x; as entering variable which gives the leaving variable x4. This 
yields the following tableau: 


w -1 0 00 оо 
x 5 73 $ 
„о Pd oo d 
1 3 2 
$$ l -5 


So, we finish with case (1) described earlier, i.e. 27% < 0. Consequently, the above LPP 
does not have a feasible solution. In fact, in the final tableau, variable x41 is still positive (so 
the original constraint x — х2 — x3 = 1 is violated). For the considered problem, the empty 
feasible region is given in Figure 9.8. (It can be seen that there are no feasible solutions for 
this problem, which confirms our computations.) 


524 - 2x2 =3 


2] 29 = 1 


Figure 9.8 Empty feasible region М for Example 9.11. 
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Remark For many problems, it is possible to reduce the number of artificial variables to be 
considered in phase I. This is important since in any pivoting step, only one artificial variable 
is removed from the set of basic variables. Thus, in the case of introducing m artificial 
variables, at least m pivoting steps are required in phase I. Let us consider the following 
example. 


Example 9.12 Consider the following constraints of an LPP: 


2x) — хз — X4 + 25 
—2х1 + 2x3 — x4 + xs 
Xj — 2x — X4 + X5 


xı + X2 + X3 


IV IV. IV IV IV 
© чм чә мо eR 


Х1,Х2,Х3,Х4,Х5 


First, we introduce in each constraint a surplus variable and obtain 


2x2 — x3 —X4-- Xs — X6 = 1 

—2x1 + 2x3 — x4 + xs — X7 LE 
ху — 2x2 — x4 +55 — Xg zug 
xi x24 x3 —x9= 5 
X1,X2,X3,X4,X5,X6,X7,X8,X9 > 0 


Now, we still need an eliminated variable with coefficient +1 in each equation constraint. 
Instead of introducing an artificial variable in each of the constraints, we subtract all other 
constraints from the constraint with the largest right-hand side. In this example, we get the 
first three equivalent constraints by subtracting the corresponding constraint from the fourth 
constraint: 


X| — Xx + 2x3 + x4 — xs — ху + XG — 4 
3x, + ә — X3 + X4 — Xs — ху + х7 E 
3x2 + X3 + X4 — X5 — ху + xg 2 

xy + x2 + X3 — X9 = 5 
X1,X2, X3, X4, X5, X6, X7, X8, X9 > 0. 


Now we have to introduce only one artificial variable x41 in the last constraint in order to 
start with phase I of the simplex algorithm. 


9.6 DUALITY; COMPLEMENTARY SLACKNESS 
We consider the following LPP denoted now as primal problem (P): 


z = сТх — max! 


s.t. Ax<b (P) 
x> 0, 
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where x = (x1,x2,...,x4)] € IR". By means of matrix А and vectors c and b, we can define 
a dual problem (D) as follows: 


w = blu — min! 
st. ATu> е (D) 
и> 0, 


where u = (Up+1,Un42,---,Unim) € R”. Thus, the dual problem (D) is obtained by the 
following rules: 


(1) The coefficient matrix A of problem (P) is transposed. 

(2) The variables of the dual problem are denoted as из, 412, .. ., Ил+т, and they have 

to be non-negative. 

The vector b of the right-hand side of problem (P) is the vector of the objective function 

of problem (D). 

(4) The vector c of the objective function of the primal problem (P) is the vector of the 
right-hand side of the dual problem (D). 

(5) In all constraints of the dual problem (D), we have inequalities with the relation >. 

(6) The objective function of the dual problem (D) has to be minimized. 


G 


— 


By ‘dualizing’ problem (P), there is an assignment between the constraints of the primal 
problem (P) and the variables of the dual problem (D), and conversely, between the variables 
of problem (P) and the constraints of problem (D). Both problems can be described by the 
scheme given in Table 9.5 which has to be read row-wise for problem (P) and column-wise 
for problem (D). 


Table 9.5 Relationships between problems (P) and (D) 


X1 X2 ха Xn < 
Un+1 ai а12 Ue aln by 
Un+2 azı 422 hen an b; 
Un4-m Om] m2 эз атп bm 
in! 
z СІ c2 ee Cn max! 


For instance, the first constraint of problem (P) reads as 
aii + d12X2 +... + ах < b1 

while e.g. the second constraint of problem (D) reads as 
Q12Upg4-1 + A22Un42 +... F Am2Untm = C2. 


Notice that the variables are successively numbered, i.e. the n variables of the primal problem 
(P) are indexed by 1, 2,..., while the m variables of the dual problem (D) are indexed by 
n+1,n+2,...,n-+ m. We get the following relationships between problems (P) and (D). 


THEOREM 9.6 Let problem (D) be the dual problem of problem (P). Then the dual problem 
of problem (D) is problem (P). 
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THEOREM 9.7 Let x = (x1,x2,...,Xn)! be an arbitrary feasible solution of problem (P) 
and u = (Un41,Un+2,---,Untm) be an arbitrary feasible solution of problem (D). Then 


20 = cix x blu = wọ. 


From the latter theorem, it follows that the objective function value of some feasible solution 
for a maximization problem is always smaller than or equal to the function value of a feasible 
solution for the dual (minimization) problem. 


THEOREM 9.8 If one of the problems (P) or (D) has an optimal solution, then the other one 
also has an optimal solution, and both optimal solutions x* and u* have the same objective 
function value, i.e. 


T, min 
zy =ex = фа" = и)". 


From Theorem 9.8 it follows that, if we know a feasible solution for the primal problem and 
a feasible solution for the dual problem, and both solutions have the same function value, 
then they must be optimal for the corresponding problems. The following theorem treats the 
case when one of the problems (P) and (D) has feasible solutions but the other one has not. 


THEOREM 9.9 If one of the problems (P) or (D) has a feasible solution but no optimal 
solution, then the other one does not have a feasible solution at all. 


Next, we consider the following question. Can we find the optimal solution for the dual prob- 
lem immediately from the final tableau for the primal problem provided that both problems 
(P) and (D) have feasible solutions? 


We introduce slack variables into the constraints of problem (P) and surplus variables into 
the constraints of problem (D), and we obtain for the constraints: 


аі X1o- d12 X2 +: Бау Xn + Xn+1 =b, 
a21 X14- 422 X2 + <- +a2n Xn + Xn+2 =} Р 
: (P^) 
Qm] Х1-Е@т2 X2 + +++ dmn Xn + Xn4+m=bm 
—ui +411 Un+1 +421 Un42 +++ ml Ип+т = 01 
—u2 +412 Un +1 +422 Un42 F 7 7 Бањо Unt+m = C2 К 
d І (D^) 
—Ип-Е@\п Ип-+1-Е@2п Unt2 + +++ mn Untm Сп. 
Then we obtain the following relationship between the primal variables x;,j = 1,2,...,n4-m, 
and the dual variables uj,j = 1,2,..., + m, where the variables are numbered according 


to (P*) and (D*). 


THEOREM 9.10 Тһе coefficients of the non-basic variables x; in the objective row of the 
optimal tableau of problem (P) are equal to the optimal values of the corresponding variables 
uj of problem (D), and conversely: The optimal values of the basic variables of problem (P) 


360 Linear programming 


are equal to the coefficients of the corresponding dual variables in the objective row of the 
optimal tableau of problem (P). 


Thus, from the optimal tableau of one of the problems (P) or (D), we can determine the 
optimal solution of the other one. 


Example 9.13 Consider the data given in Example 9.2. The optimal tableau for this problem 
has been determined in Example 9.5. Writing the dual problem with the constraints as in (D*), 
we obtain: 


w = 640из + 720u4 + 600u5 —> min! 
St. — ul + бид + би; 
= uw + 8u + бил + 3u5 = 
U1, U2, U3, Ид, US > 0. 


II 
= 


Applying Theorem 9.10, we get the following. Since in the optimal solution given in 
Example 9.5, variables x4 and x3 are non-basic variables, the corresponding variables u4 and 
u3 are basic variables in the optimal solution of the dual problem and their values are equal to 
the coefficients of x4 and x3 in the objective row of the optimal tableau of the primal problem, 
i.e. ид = 1/6 and из = 1/8. Since x2,xı and xs are basic variables in the optimal tableau 
of the primal problem, the variables из, и and us are non-basic variables in the optimal 
solution of the dual problem, i.e. their values are equal to zero. Accordingly, the values of 
the right-hand side in the optimal primal tableau correspond to the coefficients of the dual 
non-basic variables in the objective row, i.e. they are equal to 80, 40 and 120, respectively. 


Finally, we briefly deal with a ‘mixed’ LPP (i.e. equations may occur as constraints, we may 
have inequalities both with < and > sign, and variables are not necessarily non-negative). 
Then we can establish a dual problem using the rules given in Table 9.6. To illustrate, let us 
consider the following example. 


Example 9.14 Consider the LPP 


z = 2х1 + 3x2 + 4x3 —> max! 


s.t. X| — x2 + хз x 20 
—3x, + x. + хз > 3 

4р — x2 + 2x3 = 10 

X3 < 5 


xy > 0 (x2, хз arbitrary). 
Applying the rules given in Table 9.6, we obtain the following dual problem: 


w = 20u4 + Зи + 10ug + 5u7 —> min! 


s.t. ид — 3us + 4и > 42 
—u4 + Us — ug С. 
ид + us + 26 + ит = 4 


u4 > 0,u5 < 0, из > 0 (ug arbitrary). 
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Table 9.6 Rules for dualizing a problem 


Primal problem Dual problem 

(P) (D) 

Maximization problem Minimization problem 

Constraints Variables 

ith constraint: inequality with sign < = > uy4; > 0 

ith constraint: inequality with sign > = > ир; <0 

ith constraint: equation =>> ип arbitrary 

Variables Constraints 

xj = 0 = > jth constraint: inequality with sign > 
xj <0 = > jth constraint: inequality with sign < 
xj arbitrary => jth constraint: equation 


We finish this section with an economic interpretation of duality. Consider problem (P) 
as the problem of determining the optimal production quantities x; of product P;,j = 


1,2,...,n, subject to given budget constraints, i.e. for the production there are m raw 
materials required and the right-hand side value b; gives the available quantity of raw mate- 
rial Rj, i € {1,2,...,m}. The objective is to maximize the profit described by the linear 


function cT x. What is the economic meaning of the corresponding dual variables of problem 
(D) in this case? Assume that we increase the available amount of raw material R; by one 
unit (i.e. we use the right-hand side component b; = b; + 1) and all other values remain 
constant. Moreover, assume that there is an optimal non-degenerate basic solution (i.e. all 
basic variables have a value greater than zero) and that the same basic solution is optimal for 
the modified problem (P^) with b; replaced by b; (i.e. no further pivoting step is required). In 


this case, the optimal function value 20) of problem (P’) is given by 


ZN = biu* +... (i + и +... + Бих = Wo + usu 


where u* = (už +1, UM cone už ar denotes the optimal solution of the dual problem (D). 
The latter equality holds due to Theorem 9.8 (i.e. 20 = win), This means that after 
*buying' an additional unit of resource R;, the optimal function value increases by the value 
илы. Hence, the optimal values и , ; of the variables of the dual problem can be interpreted 
as shadow prices for buying an additional unit of raw materials R;, i.e. ия, ; characterizes the 


price that a company is willing to pay for an additional unit ofraw material R;, i = 1,2,...,m. 


Next, we consider the primal problem (P) and the dual problem (D). We can formulate the 
following relationship between the optimal solutions of both problems. 


THEOREM 9.11 Let x* = (x5,35,...,X; )T be a feasible solution of problem (P) and u* = 
(Ил, UPPER mu be a feasible solution of problem (D). Then both solutions x* and 
u* are simultaneously optimal if and only if the following two conditions hold: 


m 

(1) Уаш = су or хў 0 forj = 32. om 
= 
i 

(2: J ay = bi or ag = Ov fori=1,2,...,m. 
j=l 


362 Linear programming 


Theorem 9.11 means that vectors x* and u* are optimal for problems (P) and (D) if and 
only if 


(1) either the jth constraint in problem (D) is satisfied with equality or the jth variable of 
problem (P) is equal to zero (or both) and 
(2) either the ith constraint in problem (P) is satisfied with equality or the ith variable of 


problem (D) (i.e. и |; = 0) is equal to zero (or both). 


Theorem 9.11 can be rewritten in the following form: 


THEOREM 9.12 A feasible solution x* = (x7,x3,... Ed of problem (P) is optimal if 


and only if there exists a vector u* = (už "T и» me u% M such that the following two 
properties are satisfied: 
m 
К уа | 
(1) if x; > 0, then aa = с; (9.13) 


i=1 


n 
(2) if У арх? < bi, then u;,; = 0, 
j=l 


provided that vector u* is feasible for problem (D), i.e. 


m 
aiu > Cj forj — 1,2,...,m; (9 14) 
i=1 | 


иы 29 fori = 1,2,...,m. 
Remark Theorem 9.12 is often useful in checking the optimality of allegedly optimal solu- 
tions when no certificate of optimality is provided. Confronted with an allegedly optimal solu- 
tion x* of problem (P), we first set up the system of linear equations (9.13) and solve it for u* = 
(uS scores o. If the solution u* = (u7 ,,u7 5... uut is uniquely deter- 
mined, then vector x* is optimal if and only if vector и“ is feasible, i.e. conditions (9.14) hold. 


Example 9.15 Consider the LPP in Example 9.2 and the corresponding dual problem estab- 
lished in Example 9.13. Assume that we still do not know an optimal solution, but we want to 
use complementary slackness to verify that x} = 40, x; = 80 is indeed an optimal solution 
for problem (P). It follows from the second condition of (9.13) that už = 0 since, for the 
third constraint of the primal problem, we have 


6x} + 3х5 = 6-40 + 3 · 80 = 480 < 600 
while the other two constraints of ће primal problem are satisfied with equality, i.e. 
8x3 = 8.80 = 640 and бхї + 6х5 = 6:40 + 6 - 80 = 720. 
Establishing the system of linear equations resulting from the first part of (9.13), we get 
би = 1 
8u + би = 2. 
(Notice that хү = 40 > 0 and х5 = 80 > 0.) The latter system has the unique solution 


1 1 
ао and ua = е 
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which satisfies conditions (9.14). Therefore, without applying the simplex algorithm, we 
have confirmed that x = 40, х5 = 80 as well as из = 1/8, иң = 1/6, uš = 0 constitute 
optimal solutions for the primal and dual problems, respectively. 


9.7 DUAL SIMPLEX ALGORITHM 


Based on the relationships between problems (P) and (D), one can give an alternative variant 
of the simplex algorithm known as the dual simplex algorithm. While the simplex algorithm 
presented in Chapter 9.4 always works with (primal) feasible solutions and it stops when the 
current feasible solution is optimal (i.e. all coefficients g; in the objective row are greater 
than or equal to zero), the dual algorithm operates with infeasible solutions (i.e. there are 
right-hand side components smaller than zero), but the coefficients g; in the objective row 
are always all greater than or equal to zero (this means that the current solution satisfies the 
optimality criterion but it is not feasible). In the latter variant, all resulting basic solutions 
are feasible for the dual problem, but infeasible for the primal problem. 


In the dual simplex method, first the pivot row and then the pivot column are determined. 
Using the notations introduced in Chapter 9.4 (see description of the simplex algorithm), the 
pivot row and column are determined as follows provided that the simplex tableau of the 
current basic solution is as given in Table 9.3. 


Determination of the pivot row k 


Choose row k, 1 < k < m, such that b, < 0. Often, a row k is used with 


b% = min{bř [5 < 0, i=1,2,...,m}. 


Determination of the pivot column 1 


Choose column /, 1 < / < и’, such that 


zd — min EUN ар <0, j=1,2,...,n $. 
la; lag d 


Notice that, contrary to the (primal) simplex algorithm given in Chapter 9.4, the pivot element 
in the dual simplex algorithm is always negative. The determination of the pivot column as 
given above guarantees that after each pivoting step, all coefficients g; in the objective row 
remain greater than or equal to zero. In order to find the smallest quotient that defines the 
pivot column, we can add a Q row under the objective row (instead of the Q column in the 
primal simplex algorithm). The remaining transformation formulas are the same as described 
in Chapter 9.4, see step (6) of the simplex algorithm. The procedure stops when all right- 
hand sides are non-negative. The application of the dual simplex algorithm is particularly 
favourable when the given problem has the form 


z = cx — min! 


st. Ax > b 


х > 0). 
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We illustrate the dual simplex algorithm by the following example. 


Example 9.16 Let the following LPP be given: 


z = 2х1 + 3x2 + 3x3 —> min! 


s.t. xy + x — 3x3 > 6 
—x| + 22 + x > 2 

4x, + 3x2 — 2x3 > 3 
X1, X2, X3 > 0. 


We rewrite the problem as a maximization problem, multiply all constraints by —1 and 
introduce a slack variable in each constraint. This gives the following LPP: 


Z = —Z = —2ху — 3x2 — 3x3 —> max! 
s.t. -x — X2 + 3x3 + м = —6 
xy — 2x2 — X3 + xs = -—2 
—4x, — 3x. + 2x + x6 —3 


X1,X2,X3,X4,X5,X6 = 0. 


Now we can start with the dual simplex algorithm (notice that the application of the dual 
simplex algorithm does not now require the introduction of artificial variables as in the case 
of applying the primal simplex algorithm), and the initial tableau is as follows: 


] nhw xq x? xa 


bw -1 -2 -3 -3 0 


X4 0 -1 -1 3 -6 
х5 0 
х6 0 —4 -3 2 —3 


Q 2 3 = 


In the above tableau, all coefficients g; in the objective row are greater than or equal to 
zero (i.e. the optimality criterion is satisfied), but the current basic solution is infeasible 
(each basic variable is negative), which can be interpreted such that the current objective 
function value is ‘better’ than the optimal one. Applying the dual simplex algorithm, we 
first determine the pivot row. Choosing the smallest right-hand side component, x4 becomes 
the leaving variable, and based on the values in the quotient row, x; becomes the entering 
variable, which gives the pivot element —1. This leads to the following tableau: 


ы 
е 
о 
| 
A 
We Luo 
| 
= 
© 
кә 
ES 
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Choosing now xs as leaving variable, the only possible entering variable is x2 which gives 
the pivot element —3. We obtain the following tableau: 


> 
е 

| 
= 
= 
o 

| 
чө 
© 


2 1 7 10 
edu We Nc Ж 
x) —3 3.099 3 3 

11 1 28 55 
xs 0 -3 EE M. 

7 1 29 _44 

3 3 3 3 

Q 


Now, all right-hand sides are non-negative, and so the procedure stops. The optimal solution 
is as follows: 


10 8 55 


AS pedit X3 = X4 = xs = 0, x6 = су» 


EXERCISES 


9.1 Solve the following problems graphically: 


(a) z= х +x > min! (b) z= —x, + 4x? —> min! 
st. x + xn > 2 st. xy — 2x. < 4 
x — x > 3 —x| + 2x < 4 
xj + 22 x 6 xj + 2x < 8 
xj + 4% > 0 хх > 0 
хх) = 0 
(c) z= —x, — 2x2 — min! (d) z= х +x2 — max! 
st. —хд + x < 4 St x — x > O 
xj + 2x9 < 11 —xj — 2x x 4 
2x, + x) x 10 хх > 0 
ХІ < 4 
Х1,х2 = 0 


9.2 A craftsman has a free capacity of 200 working hours which he wants to use for making 
two products A and B. Production of one piece uses up | hour for A and 4 hours for B. 
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The number of pieces of product A is at most 100. The number of pieces of product B 
must be at least 30, but more than three times the amount of A is impossible. Finally, 
products A and B can be sold at prices of 20 EUR and 27 EUR but the variable costs 
incurred amount to 10 EUR and 21 EUR per piece of products A and B, respectively. 
What output combination should the craftsman choose in order to maximize the total 
profit? Find the solution of the problem graphically. 


9.3 Find the standard forms of the following linear programming problems: 


(a z= x1 — 2x2 + x3 — min! 


st. xj + x + x < 7 
3x, — xj + oux > 4 
X1,%2,%3 = 0 


(b) z= —x1 + 2x2 — 3x3 + x4 — min! 


st. 2x + 2x2 — хз + 3x4 = 8 
XI + 2x3 + x < 10 
—2x| + 2x3 — 3x4 > 0 


xı <0, 2,2320, x4€R 


9.4 (а) Find the optimal solution of the following problem graphically and by the simplex 
method: 


z = х1 + хә max! 


st 3x, + 2x. < 6 
x) + 40 < 4 
х0 = 0 


(b) Solve problem 9.1 (c) by the simplex method. 
9.5 Solve the following problems by the simplex method: 


(a) z= 7x; + 4x2 + 5x3 + 6x4 — max! 


s.t. 20x, + 10x2 + 12x3 + 16x, < 400 
X3 < 5 

xı + x?) + x3 + x4 < 30 
X1,X2,X3,X4 > 0 


(b) z= 2x, — 6x2 > min! 


st. 2x1 — x < 10 
хр — 3x2 < 15 

3x1 + хз = 12 
X1,%2,%3 > 0 


9.6 Solve problem 9.2 by the two-phase simplex algorithm. 
9.7 Solve the following linear programming problems by the simplex method: 


(a) z=x,+x2 – х3 — min! 


st. 3x) — x — 4x3 < 0 
xy + 2x2 > 10 

x + 3x3 > 4 

X1,X2,x%3 > 0 


9.8 


9.9 


9.10 


9.11 


(b z=x, + 2x2 + хз — max! 


st x + x < 10 
X2 + x4 < 14 
x] + x3 > 15 
X|,X2,X3 > 0 
(c) z= 2x, — x2 + x3 — max! 
St. xy + x — x — X4 
2x| + 3x2 — X3 — 2x4 


x2 + 2x3 
X1, X2, X3, X4 


Il 
© охо ь 


IV IV. Il 
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Formulate the dual problems of the three primal problems given in exercises 9.5 (a), 


9.7 (a) and 9.7 (c). 


Find the dual problems and both the primal and dual optimal solutions: 


(a) z= 2x, — 2x2 — x3 — x4 — min! 


st. —xp + 2x2 + лз — м 
X| — Хх + 0x4 
xy + 40 — 2x4 


X1,X2, X3, X4 
(b) z= 3x1 — x2 + x3 + x4 — max! 


st. 2x + x; — 3x4 — X4 
—x| + x + 3x — 2x4 
—X| + 2x3 + X4 


X1. X2, X3, X4 


IV IA IA IA 


IV IA IA IA 


4 
4 
4 
0 


or he 


A firm produces by means of three raw materials R1, R2 and Ёз two different products 
P, and P». Profits, raw material requirements and capacities are given in the following 


table: 
Pı P2 capacity 
Ri 2 4 16 
Ro 2 1 10 
R3 4 0 20 
profitperunitP; 2 3 


Use the method of complementary slackness to decide whether a production of four 


units of P, and two units of P» is optimal. 


A publicity agency needs at least 100 paper strips that are 1 m long and 5 cm wide, 
200 strips that are 1 m long and 3 cm wide and 400 strips that are 1 m long and 2 cm 
wide. The necessary strips can be cut from 1 m long and 10 cm wide strips. 


(a) Find all cutting variants without waste strips. 
(b) Formulate a linear programming problem, where the number of 10 cm wide strips 
needed is minimized. Denote by x; the number of 10 cm wide strips that are cut 


by variant i. 


(c) Solve the problem by the dual simplex algorithm. 


10 Eigenvalue problems апа 
quadratic forms 


In this chapter, we deal with an application of homogeneous systems of linear equations, 
namely eigenvalue problems. Moreover, we introduce so-called quadratic forms and inves- 
tigate their sign. Quadratic forms play an important role when determining extreme points 
and values of functions of several variables, which are discussed in Chapter 11. 


10.1 EIGENVALUES AND EIGENVECTORS 


An eigenvalue problem can be defined as follows. 


Definition 10.1 Let A be an n x n matrix. Then the scalar А is called an eigenvalue 
of matrix A if there exists a non-trivial solution x € К”, х Æ 0, of the matrix equation 


Ах = Ах. (10.1) 


The solution x" = (xj,x2,...,x4,) x (0,0,...,0) is called an eigenvector of А 


(associated with scalar à). 


Equation (10.1) is equivalent to a homogeneous linear system of equations. From the matrix 
equation 


AX = Ах = Мх 
we obtain 
(A — ADx = 0, (10.2) 


where J is the identity matrix of order n x n. Hence, system (10.2) includes n linear equations 
with n variables x1, x2, . . . , Xs. 


According to Definition 10.1, eigenvalue problems are defined only for square matrices A. 
In an eigenvalue problem, we look for all (real or complex) values А such that the image of 
a non-zero vector x given by a linear mapping described by matrix A is a multiple Ax of this 
vector x. It is worth noting that value zero is possible as an eigenvalue, while the zero vector 
is not possible as an eigenvector. Although eigenvalue problems arise mainly in engineering 
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sciences, they also have some importance in economics. For example, as we show in the 
following two chapters, they are useful for deciding whether a function has an extreme point 
or for solving certain types of differential and difference equations. 


The following theorem gives a necessary and sufficient condition for the existence of non- 
trivial (i.e. different from the zero vector) solutions of problem (10.1). 


THEOREM 10.1 Problem (10.1) has a non-trivial solution x zz 0 if and only if the 
determinant of matrix A — AJ is equal to zero, і.е. |4 — АД = 0. 


The validity of the above theorem can easily be seen by taking into account that a homoge- 
neous system (10.2) of linear equations has non-trivial solutions if and only if the rank of 
the coefficient matrix of the system (i.e. the rank of matrix A — АГ) is less than the number 
of variables. The latter condition is equivalent to the condition that the determinant of the 
coefficient matrix is equal to zero, which means that matrix A — AJ has no inverse matrix. 


We rewrite the determinant of matrix A — АГ as a function of the variable A. Letting P(A) = 
|A — АЦ, where A = (aj) is a matrix of order n x n, we get the following equation in A: 


ац = А a12 x ain 
a21 an = À ... an 
РО.) = ; ; | = 0, 
P» an2 e Ogg = А 


which is known as the characteristic equation (or eigenvalue equation) of matrix A. 


From the definition of a determinant, it follows that P(A) is a polynomial in A which has 
degree n for a matrix A of order n x n. The zeroes of this characteristic polynomial 


P(A) = (-1)"A" + b, АТ E + BA + bo 


of degree n are the eigenvalues of matrix A. Thus, in order to find all eigenvalues of an n x n 
matrix A, we have to determine all zeroes of polynomial P(A) of degree n (i.e. all roots ofthe 
characteristic equation P(A) = 0). Here we often have to apply numerical methods (described 
in Chapter 4) to find them (approximately). In general, the eigenvalues of a real matrix A 
can be complex numbers (and also the eigenvectors may contain complex components). The 
following theorem describes a case when all eigenvalues of matrix A are real. 


THEOREM 10.2 If matrix A of order n x n is a symmetric matrix (i.e. A = АТ), then all 
eigenvalues of A are real numbers. 


For each eigenvalue à; i = 1,2,...,n, we have to find the general solution of the 
homogeneous system of linear equations 


(A — Dx = 0 (10.3) 


in order to get the corresponding eigenvectors. Since the rank of matrix A is smaller than 
n, the solution of the corresponding system of equations is not uniquely determined and for 
each eigenvalue A;, i = 1,2,...,n, of system (10.3), there is indeed a solution where not all 
variables are equal to zero. 
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We continue with some properties of the set of eigenvectors belonging to the same eigenvalue. 


THEOREM 10.3 Let A be an eigenvalue of multiplicity k (i.e. A is k times a root of the 
characteristic equation P(A) = 0) of a matrix A of order n x n. Then: 


(1) The number of linearly independent eigenvectors associated with eigenvalue 4 is at least 
one and at most k. 

(2) IfA is a symmetric matrix, then there exist k linearly independent eigenvectors associated 
with A. The set of linearly independent eigenvectors associated with eigenvalue A forms 
a vector space. 


As a consequence of Theorem 10.3, we mention that, if x! and x? are eigenvectors associated 
with eigenvalue A, then also vector sx! + tx? with s є К and t € R is an eigenvector 
associated with A. It also follows from Theorem 10.3 that for an arbitrary square matrix A there 
always exists exactly one linearly independent eigenvector associated with an eigenvalue of 
multiplicity one. 


THEOREM 10.4 Let A be a matrix of order n x n. Then: 
(1) Eigenvectors associated with different eigenvalues of matrix A are linearly independent. 
(2) If matrix A is symmetric, then eigenvectors associated with different eigenvalues are 


orthogonal. 


Let us consider the following three examples to determine all eigenvalues and eigenvectors 
of a given matrix. 


Example 10.1 We determine the eigenvalues and eigenvectors of matrix 


The characteristic equation is given by 


1—2A 2 


PQ) = d= all =| e^ 


|-a-xd-3-4- X -n-ico 


with the solutions 
Ay=1ltv14+3=3 and Ag =1-vV14+3=-1. 


To determine the corresponding eigenvectors, we have to solve the matrix equation 
(A — AD)x = 0 for à = А and à = А2. Thus, we get the following system for A = А = 3: 


—2х + 2x2 = 0 
2x1 з= 2х2 = 0. 


The second equation may be obtained from the first equation by multiplying by —1 (i.e. both 
row vectors of the left-hand side are linearly dependent) and can therefore be dropped. The 
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coefficient matrix of the above system has rank one and so we can choose one variable 
arbitrarily, say x2 = t,t € IR. This yields x; = t, and each eigenvector associated with the 
eigenvalue A, = 3 can be described in the form 


sed іє К. 


Analogously, for A = А = —1, we get the system: 


2x1 + 2x2 = 0 
2х1 + 2x2 = 0. 
Again, we can drop one of the two identical equations and can choose one variable arbitrarily, 


say x? = s,s € R. Then we get x, = —s. Thus, all eigenvectors associated with Ал = —1 
can be represented as 


ce, seER. 


This example illustrates Theorem 10.4. For arbitrary choice of s, t € R, the eigenvectors x 
and x? are linearly independent and orthogonal (i.e. the scalar product of vectors x! and x? 
is equal to zero). 


1 


Example 10.2 We determine all eigenvalues and eigenvectors of matrix 


To find the eigenvalues, we consider the characteristic equation P(A) = 0: 


= EE —à -1 0 
Pà)-2|4-AH2|-7 -A 5 |2|-7 -A 5-Al=0. 
5 —2 5—ì 5 —2 3—ì 


The above transformation is obtained by adding columns 2 and 3 to get the third element 
equal to zero in row 1. Expanding the latter determinant by row 1, we obtain 


P(A) = |4- АЩ 2 —A- Ё 


= 47(3 — A) — 2A(5 — А) + [(-7) - (8 А) + 5(5 – А)] 
= (34? — 2° — 10A 227) + (—21 + 7A + 25 — 5)) 
= А3 + 5? — 8A + 4. 


Considering now the characteristic equation P(A) = 0, we try to find a first root and 
use Horner’s scheme (see Chapter 3.3.3) for the computation of the function value. 
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Checking A; = 1, we get 
—1 5 -8 4 


м =1 -1 4 —4 
|-1 4 -4 0 


i.e. Лу = 1 is a root of the characteristic equation P(A) = 0. From Horner's scheme (see last 
row), we obtain that dividing P(A) by the linear factor A — А = А — 1 gives the polynomial 
P(A) of degree two: 

Р›(А) = -А2 + 4X — 4. 
Setting P(A) = 0, we obtain 


àQ0224 44-422 and | 21322-— /4—4-2, 


Le. A2 = Аз = 2 is an eigenvalue of multiplicity two. 


In order to determine the eigenvectors associated with A; = 1, we get the homogeneous 
system of linear equations 


Se) = ә + хз =0 
—7x,; — ху + 5x3 =0 
—5xj — 2x? + 43 =0. 


Applying Gaussian elimination, we get the following tableaus: 


Row ху х хз b Operation 


1 -1 -1 1 0 

2 —7 —1 5 0 

3 —5 -2 4 0 

4 -1 -1 1 0 row 1 

5 0 6 —2 0 row2-7rowl 
6 0 3 —1 0 row3-Srowl 
7 -1 -1 1 0 row 4 

8 0 6 —2 0 row 5 

9 0 0 0 0 row6-}row5 


Since the rank of the coefficient matrix is equal to two, we can choose one variable arbitrarily. 
Setting хз = 3, we get x2 = 1 and xı = 2 (here we set хз = 3 in order to get integer solutions 
for the other two variables), i.e. each eigenvector associated with A; = 1 has the form 
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Considering the eigenvalue A? = Аз = 2, we get the following system of linear equations: 


—2x, — x2 + x3 = 0 
—7х\ — 2x. + 5x3 = 0 
—5x| — 2x. + 3x3 = 0. 


After applying Gaussian elimination or pivoting, we find that the coefficient matrix of this 
system of linear equations has rank two. Hence we can choose one variable arbitrarily. If we 
choose хз = 1, we get xo = —1 and x, = 1. Therefore, each eigenvector associated with 
eigenvalue Лә = Аз = 2 has the form 


In particular, for the latter eigenvalue of multiplicity two, there exists only one linearly 
independent eigenvector. 


Example 10.3 We determine the eigenvalues and eigenvectors of matrix 


-4 -3 3 
AL 2 3 o 
-1 -3 0 


The characteristic equation is given by 


4-0 -3 3 
Р) =|А-М|=| 2 3—A -6 
—1 -3 —ì 
—4- А 0 3 
=| 2 -3-A -6 
1 Gan wA 
—4— 0 3 
=| 3 0 —64AX 
1 3—2A А 
—4— 0 3 
=(-3-A)-| 3 0 —64A 
zx. тее 
—4— À 3 
22x ЫЗ —6 +À 


= —(-3—A)-[(-4—A)(-6+ A) — 9] 
(3+A)-(—A? + 2А + 15) = 0. 


In the transformations above, we have first added column 3 to column 2, and then we have 
added row 3 multiplied by —1 to row 2. In the next step, the term (—3 — A) has been factored 
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out from the second column and finally, the resulting determinant has been expanded by 
column 2. From equation 3 + A = 0, we obtain the first eigenvalue 


and from equation А2 + 2А + 15 = 0, we obtain the two eigenvalues 
Ал = —3 and Аз = 5. 


Next, we determine a maximum number of linearly independent eigenvectors for each eigen- 
value. We first consider the eigenvalue 4; = А = —3 of multiplicity two and obtain the 
following system of linear equations: 


—x1 — 3x2 + 3x3 = 0 
2х1 + 6x2 — бхз = 0 
—X4 — 3x2 + 3x3 = 0. 


Since equations one and three coincide and since equation two corresponds to equation one 
multiplied by —2, the rank of the coefficient matrix of the above system is equal to one, 
and therefore we can choose two variables arbitrarily. Consequently, there exist two linearly 
independent eigenvectors associated with this eigenvalue. Using our knowledge about the 
general solution of homogeneous systems of linear equations, we get linearly independent 
solutions by choosing for the first vector x! : xl = ld = 0 and for the second vector 
х2 : x2 = 0,x2 = ] (ie. we have taken x2 and x3 as the variables that can be chosen 
arbitrarily). Then the remaining variables are uniquely determined and we obtain e.g. from 


the first equation ofthe above system: H^ = —3, x? = 3. Therefore, the set of all eigenvectors 
associated with Az = Аз = —3 is given by 
—3 3 
xeR? | х= 1 +t] 0 |, seR,teEeR 
1 


While in Example 10.2 only one linearly independent vector was associated with the eigen- 
value of multiplicity two, in this example there are two linearly independent eigenvectors 
associated with an eigenvalue of multiplicity two. This is the maximal possible number since 
we know from Theorem 10.3 that at most k linearly independent eigenvectors are associ- 
ated with an eigenvalue of multiplicity k. To finish this example, we still have to find the 
eigenvalue associated with Аз = 5 by the solution of the following system of linear equations: 


—9x, — 3x2 + 3x3 = 0 
2x1 i= 2x2 = 6x3 =0 


—x| — 3x2 — 5x3 = 0. 


By applying Gaussian elimination or pivoting, we find that the coefficient matrix has rank 
two, and therefore one variable can be chosen arbitrarily. Choosing x3 = 1, we finally get 


X2 — —2xa = —2 апа X] — —3x2 = 5x3 = 
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Therefore, an eigenvector associated with Аз = 5 can be written in the form 


In the next chapters, we show how eigenvalues can be used for solving certain optimization 
problems as well as differential and difference equations. The problem of determining eigen- 
values and the corresponding eigenvectors often arises in economic problems dealing with 
processes of proportionate growth or decline. We demonstrate this by the following example. 


Example 10.4 Let xM be the number of men and x" the number of women in some 
population at time f. The relationship between the populations at some successive times 
t and t + 1 has been found to be as follows: 


хМ | = 0.8х + 0.4% 


х = 0.3х + 0.9%)". 


Letting x! = (M, a yr we obtain the following relationship between the populations x^*! 
and x’ at successive times: 


Sq & m х 
Ww = у | 
X 0.3 0.9 x) 
Moreover, we assume that the ratio of men and women is constant over time, i.e. 
M M 
Ww = у | 
х1 х; 


Now, the question is: do there exist such values А є R4 and vectors x’ satisfying the above 
equations, i.e. can we find numbers A and vectors x’ such that 


Ax! = Ах! ? 


To answer this question, we have to find the eigenvalues of matrix 
0.8 0.4 
im (03 25) 


and then for an appropriate eigenvalue the corresponding eigenvector. We obtain the 
characteristic equation 


0.8 — A 0.4 


03 0921 = Q8 —2X(0.9 — 2) - 0.3 0.4 = 0. 


PQ) = и) 
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This yields 


P(A) = А2 – 17А + 0.6 = 0 
and the eigenvalues 
А = 0.85 + /0.7225 — 0.6 = 0.85 + 0.35 = 1.2 and À2 = 0.85 — 0.35 = 0.5. 


Since we are looking for a proportionate growth in the population, only the eigenvalue greater 
than one is of interest, i.e. we have to consider Лу = 1.2 and determine the corresponding 
eigenvector from the system 


—04xM + 04x" =0 
03M — 03x" =0. 


The coefficient matrix has rank one, we can choose one variable arbitrarily and get the 
eigenvector 


In order to have a proportionate growth in the population, the initial population must consist 
of the same number of men and women, and the population grows by 20 per cent until 
the next time it is considered. This means that if initially at time t = 1, the population 
is given by vector x! = (1,000; 1,000)", then at time ¢ = 2 the population is given by 
x? = (1,200; 1,200)7, at time t = 3 the population is given by x? = (1,440; 1,440)T, and 
so on. 


10.2 QUADRATIC FORMS AND THEIR SIGN 


We start with the following definition. 


Definition 10.2 If 4 = (ajj) is a matrix of order n x n and x! = (x1,x2,... , Xn), then 
the term 


Q(x) = xl Ax (10.4) 


is called a quadratic form. 
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Writing equation (10.4) explicitly, we have: 


aj 12 Qin ХІ 

a» 422 an X2 
Q(x) = О(х\,хә,...,х„) = (1,22, ..., Xn) . 

Qn] An -... Ann Xn 


= aj1x1x1 + a12x1x2 + +++ + A1nX1Xn + 421X2X1 + a22x2X2 +++ + A2nX2Xn + ``: 


+ ani XnX1 + An2XnX2 + +++ + AnnXnXn 


n n 
- XXe 


i=1 j=1 


THEOREM 10.5 Let А be a matrix of order n x n. Then the quadratic form x! Ax can be 
written as a quadratic form хТА*х of a symmetric matrix A* of order n x n, i.e. we have 
xl Ax = x! A*x, where 


А* = ; (4 +47). 


As a result of Theorem 10.5, we can restrict ourselves to the consideration of quadratic forms 
of symmetric matrices, where all eigenvalues are real numbers (see Theorem 10.2). In the 
following definition, the sign of a quadratic form x! Ax is considered. 


Definition 10.3 A square matrix A of order n x n and its associated quadratic form 
Q(x) are said to be 


(1) positive definite if Q(x) = x' Ax > 0 forall х! = (x1,x2,...,x4) Æ (0,0,...,0); 
(2) positive semi-definite if Q(x) = x! Ax > 0 for all x є R^; 

(3) negative definite if Q(x) = x! Ax < OforallxT = (x1,x2,...,x4) Æ (0,0,...,0); 
(4) negative semi-definite if Q(x) = x! Ax < 0 for all x € R”; 

(5) indefinite if they are neither positive semi-definite nor negative semi-definite. 


The following example illustrates Definition 10.3. 


Example 10.5 Let 


se a 


We determine the sign of the quadratic form Q(x) = x1 Ax by applying Definition 10.3. Then 


ыер 1 (2) 


= xı (x1 — x2) + xY(—x1 + x2) 
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= х (xı — x2) — x2 (x1 — x2) 
= Qm — 0)? > 0. 
Therefore, matrix A is positive semi-definite. However, matrix A is not positive definite since 


there exist vectors ХГ = (x1,x2) Æ (0,0) such that Q(x) = хТАх = 0, namely if x1 = x2 
but is different from zero. 


The following theorem shows how we can decide by means of the eigenvalues of a symmetric 
matrix whether the matrix is positive or negative (semi-)definite. 


THEOREM 10.6 Let A be a symmetric matrix of order n x n with the eigenvalues 
Ат, À2, ..., Àn € R. Then: 


(1) A is positive definite if and only if all eigenvalues of A are positive (i.e. A; > 0 for 
і = 1,2,...,n). 

(2) Ais positive semi-definite if and only ifall eigenvalues of A are non-negative (i.e. A; > 0 
for i — 1,2,...,n). 

(3) A is negative definite if and only if all eigenvalues of A are negative (i.e. A; < 0 for 
i= 1,2,...,n). 

(4) Ais negative semi-definite if and only if all eigenvalues of A are non-positive (i.e. A; < 0 
for i — 1,2,...,n). 

(5) A is indefinite if and only if A has at least two eigenvalues with opposite signs. 


Example 10.6 Let us consider matrix A with 


22 (0.9. 


We determine the eigenvalues of A and obtain the characteristic equation 


—A 2 


PQ) = ~All =| ees 


|-o 
This yields 
SF раа SH 2S 0, 


The above quadratic equation has the solutions 


A= rz le Digs Slay and 
2 4 2- 2 

A2= : E s | з 2 < 0. 
2 4 2 2 


Since both eigenvalues have opposite signs, matrix A is indefinite according to part (5) of 
Theorem 10.6. 
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Next, we present another criterion to decide whether a given matrix А is positive or negative 
definite. To apply this criterion, we have to investigate the sign of certain minors introduced 
in the following definition. 


Definition 10.4 Тһе leading principal minors of matrix A = (aj) of order n x n are 
the determinants 


aj 412 aik 
a» 422 ak 

Dk coe . > k = 1,22, ,n, 
ар ар cc ак 


i.e. Dj is obtained from |A| by crossing out the last п — k columns and rows. 


By means of ће leading principal minors, we can give a criterion to decide whether a matrix 
A is positive or negative definite. 


THEOREM 10.7 Let matrix A be a symmetric matrix of order n x n with the leading principal 
minors Dj, k = 1,2,...,n. Then: 


(1) A is positive definite if and only if Dj > 0 fork = 1,2,...,n. 
(2) A is negative definite if and only if (-D- Dg > 0 for k = 1,2,...,n. 


For a symmetric matrix A of order n x n, we have to check the sign of n determinants to 
find out whether A is positive (negative) definite. If all leading principal minors are greater 
than zero, matrix A is positive definite according to part (1) of Theorem 10.7. If the signs of 
the n leading principal minors alternate, where the first minor is negative (i.e. element aj is 
smaller than zero), then matrix A is negative definite according to part (2) of Theorem 10.7. 
The following two examples illustrate the use of Theorem 10.7. 


Example 10.7 Let 


—3 2 0 
А= 2 =3 0 
0 0 —5 


For matrix A, we get the leading principal minors 


Dı =a, = —3 < 0, 


| ou ap |-3 25 /— mrs 
Pu ay P =| 2 E 4=5>0, 
ај а аз —3 2 0 
Рз = |а аә аз = | 2 -3 0| = –5р, = –25 <0. 


азі 432 азз 0 0 —5 
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Since the leading principal minors Dz, є {1,2,3}, alternate in sign, starting with a negative 
sign of Dj, matrix А is negative definite according to part (2) of Theorem 10.7. 


Example 10.8 We check for which values a € R the matrix 


2100 
1а 10 
А= |, 131 
0012 


is positive definite. We apply Theorem 10.7 and investigate for which values of a all leading 
principal minors are greater than zero. We obtain: 


Dı =2>0, 
Dic? ыу 00. 
l a 


The latter inequality holds for a > 1/2. Next, we obtain 


2 
Delt 
0 


210 
рл = |А\=—|1 a 0|+2Рз = —(2а— 1) + 12a — 10 = 10a — 9 > 0, 
011 


which holds for a > 9/10. Due to 1/2 < 5/6 < 9/10, and since all leading principal minors 
must be positive, matrix A is positive definite for a > 9/10. 


THEOREM 10.8 Let A bea symmetric matrix of order n x n. Then: 


(1) If matrix A is positive semi-definite, then each leading principal minor Dj, k = 
1,2,...,n, is non-negative. 


(2) If matrix A is negative semi-definite, then each leading principal minor D+ is either zero 
or has the same sign as (1 k = 1,2,...,n. 


It is worth noting that Theorem 10.8 is only a necessary condition for a positive (negative) 
semi-definite matrix A. If all leading principal minors of a matrix A are non-negative, we 
cannot conclude that this matrix must be positive semi-definite. We only note that, in order 
to get a sufficient condition for a positive (negative) semi-definite matrix, we have to check 
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all minors of matrix A and they have to satisfy the conditions of Theorem 10.8 concerning 
their signs. 


EXERCISES 


10.1 Given are the matrices 


—3 —2 4 2 0 0 
C= ]|-3 2 3 and D = 2 1 -2 
—2 —2 3 —1 0 2 


Find the eigenvalues and the eigenvectors of each of these matrices. 


10.2 Let x; be the consumption value of a national economy in period ¢ and y; the capital 
investment of the economy in this period. For the following period ¢ + 1, we have 


Xt] = 0.7x; + 0.6); 


which describes the change in consumption from one period to the subsequent one 
depending on consumption and capital investment in the current period. Consumption 
increases by 70 per cent of consumption and by 60 per cent of the capital investment. 
The capital investment follows the same type of strategy: 


Уу+1 = 0.6x; + 0.2y;. 
Thus, we have the system 


U1 = Au; with и, = (х,у), = 1,2,.... 


(a) Find the greatest eigenvalue à of the matrix A and the eigenvectors associated 
with this value. 

(b) Interpret the result above with A as a factor of proportionate growth. 

(c) Let 10,000 units be the sum of consumption value and capital investment in the 
first period. How does it have to be split for proportionate growth? Assume you 
have the same growth rate A for the following two periods, what are the values 
of consumption and capital investment? 


10.3 Given are the matrices 


^ з 2 ^ EN 

A= and B= {0 1 3 
00-2 0 031 
0 0 0 5 


(a) Find the eigenvalues and the eigenvectors of each of these matrices. 
(b) Verify that the eigenvectors of matrix A form a basis of the space IR^ and that the 
eigenvectors of matrix B are linearly independent and orthogonal. 
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10.4 


10.5 


10.6 


Verify that the quadratic form x! Bx with matrix B from Exercise 10.1 is positive 
definite. 


Given are the matrices: 


3 -1 2» 2 
de ade 


1 1 1 0 2 
C= = 5 0 and D= {0 1 0 
240 0 205 
(a) Find the eigenvalues of each of these matrices. 
(b) Determine by the given criterion (see Theorem 10.7) which of the matrices 
A, B, C, D (and their associated quadratic forms, respectively) are positive definite 
and which are negative definite. 


(c) Compare the results of (b) with the results of (a). 


Let x = (1,1,0)T be an eigenvector associated with the eigenvalue A; = 3 of the 
matrix 


wm 


ai 0 1 
A—-|2 a 0 
1 -1 аз 


(a) What can you conclude about the values of a1, аз and a3? 

(b) Find another eigenvector associated with A. 

(c) Is it possible in addition to the answers concerning part (a) to find further 
conditions for a1, аз and аз when А is positive definite? 

(d) If your answer is affirmative for part (с), do you see a way to find ај, аз and аз 
exactly when A? = —3 is also an eigenvalue of the matrix A? 


11 Functions of several variables 


In Chapter 4, we considered such situations as when a firm produces some output y = f (x) 
by means ofa certain input x. However, in many economic applications, we have to deal with 
situations where several variables have to be included in the mathematical model, e.g. usually 
the output depends on a set of different input factors. Therefore, we deal in this chapter with 
functions depending on more than one independent variable. 


11.4 PRELIMINARIES 


If we have a pair (х,у) є R? of two input factors, the output may be measured by a function 
value f (x, y) є R depending on two independent variables x and y. The notion of independent 
variables means that each variable can vary by itself without affecting the others. In a more 
general case, the input may be described by an n-tuple (x1, x2, ..., Xn) € IR" of numbers, and 
the output is described by a value f (x1,x2,. ..,x,) € R. In the former case, f : R? > Risa 
function of two variables mapping the pair (x, y) into a real number f (x, y). In the latter case, 
f: R” — Risa function ofn variables, or we also say that it is a function of an n-dimensional 
vector x. Instead of f(x1,x2,...,Xn), we also use the notation f(x) for the function value, 
where x denotes an n-vector. Similarly as in the case of a function of one variable, we denote 
the set of points for which the function value is defined as domain Dy of function f . Formally 
we can summarize: a function f : Dy > IR, Dy C R”, depending on n variables x1, x2, . . . , Xn 
assigns a specified real number to each n-vector (or point) (x1,x2,. .., x4) € Dy. (Since vector 
x can also be considered as a point, we always skip the transpose sign ‘T’ of a row vector in 
this chapter.) 


Example 11.1 (Cobb-Douglas production function) A production function assigns to 
each n-vector x with non-negative components, where the ith component represents the 
amount of input i (і = 1,2,...,n; such a vector is also known as an input bundle), the 
maximal output z = f'(x1,x2, ..., x4). Assume that an agricultural production function Q, 
where Q is the number of units produced, depends on three input variables K, L and R, where 
K stands for the capital invested, L denotes the labour input and R denotes the area of land that 
is used for the agricultural production. If the relationship between the independent variables 
K,L and R and the dependent variable (output) Q is described by the equation 


Q(K,L,R) = А. K” . L” . аз, 
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where A, a1, a2, a3 are given parameters, we say that О = O(K,L,R) is a Cobb—Douglas 
function. In this case, the output depends on the three variables K, L and R, but in general, it 
may depend on л variables, say K1, Ko,..., Kn. 


The graph of a function f with z = f(x,y), called a surface, consists of all points (x, y, 2) 
for which z = f(x,y) and (x, y) belongs to the domain Dy. The surface of a function f with 
ру © R? is illustrated in Figure 11.1. To get an overview оп а specific function, we might be 
interested in knowing all points (x, y) of the domain Dy which have the same function value. 
A level curve for function f depending on two variables is a curve in the xy plane given by 


2=/(х,у) = С, 


where C is a constant. This curve is also called an isoquant (indicating ‘equal quantity’). To 
illustrate, consider the following example. 


Surface z = f(x,y) 


SS Domain D; 


Y 


Figure 11.1 The surface z = f (x, y). 


Example 11.2 Let function f : Dy > R with 
fy) =42-y*, ру = ({(,у) ЄВ | -1<х51, -2<ух2), 


be given. We determine the isoquants for this function. Putting 4х2 — y? = С, where С 
denotes a constant, and eliminating variable y (and x, respectively), we get 


1 
у= +у4х2 – С ( or, correspondingly, cS J/» + c) } 
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For C = 0, we get the two lines y = 2x and y = —2x intersecting at point (0,0). For C z 0, 
each of the equations above gives two functions (namely one with sign ‘+’ and one with sign 
‘—’) provided that C is chosen such that the term under the square root is non-negative. The 
level curves for some values of C are given in Figure 11.2. 


L—- 


2 £ 


Figure 11.2 Level curves for function f with f (x,y) = 4x? — y?. 


Often, one wishes to determine to which value a function tends provided that the independent 


variables tend to a certain point x? = (at х0 Mom 3): In the following, we introduce the limit 


of a function of n independent variables as a generalization of the limit of a function of one 
variable introduced in Chapter 4.1.1. 


Definition 11.1 The real number L is called the тїї of the function f : Dy — R, 
Dy © R”, as x = (1,32,...,x4) tends to point х? = (ТУ; ae 140) if for any real 
number є > 0 there exists a real number ô = 8(є) > 0 such that 


If - L |< e, 
provided that 


li-a9|«8, kasi «3, |08. 


We can also write 


lim. f@= Г. 


XX 
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The limit exists only if L is a finite number. The left-hand side in the above notation means 
that the variables x; tend simultaneously and independently to x In the case of a function of 
one variable, there were only two possibilities in which a variable x might tend to a value xo, 
namely either from the right-hand side or from the left-hand side. Definition 11.1 requires 
that the limit has to exist for any possible path on which x approaches х0. 


A specific path in the case of a function f depending on two variables is e.g. considered if we 
first determine the limit as x tends to x°, and then the limit as y tends to у?, i.e. we determine 


Lı = lim lim. f (x, y). 


yoy? xx 


Accordingly, we can interchange the order in which both limits above are determined. These 
specific limits are called iterated limits. However, from the existence and equality of both 
iterated limits one cannot conclude that the limit of function f exists, as the following example 
shows. 


Example 11.3 Let function f : Dy > R with 


2x — 2y : 2 
foy) = (==) > Dy = (Gy) є Ҝ | y A =x}, 


and (х9, y?) = (0, 0) be given. For the iterated limits we get 


= 


Lı = lim lim ( 
x+y 


= lim (—2)* = lim 4 — 4; 
y0 x0 y0 


y0 


= lim 2? = lim 4 — 4. 


x0 x0 


220) 


L = lim lim ( 
x+y 


x0 y—0 


Although the iterated limits exist and are equal, the limit of function f as (x, y) tends to (0, 0) 
does not exist since e.g. along the path y = x, we get 


0 
foy) =f (x, x) = im 0 fo x40 
X 
and consequently 
Із = lim f(x,x) = 0, 
x0 


which is different from L; and Г». 


Next, we deal with the generalization of the concept of a continuous function to the case of 
functions of more than one variable. Roughly speaking, a function f of n variables is con- 
tinuous if small changes in the independent variables produce small changes in the function 
value. Formally we can introduce the following definition. 
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Definition 11.2 A function f : Dy > R, Dy С R”, is said to be continuous at point 
х9 = (aen Dae 232) € Dy if the limit of function f as x tends to x? exists and if this 
limit coincides with the function value f (х0), i.e. 


. 0 0 0 
Шт f (х1,х2,...,хп) =f 9, x5... .,x$). 
x>x? 


If function / is continuous at all points x? є Dy, then we also say that f is continuous. 


Analogously to functions of one variable (see Theorems 4.3 and 4.4 in Chapter 4.1.2), we 
get the following statement for functions of more than one variable: Any function of n > 1 
independent variables that can be constructed from continuous functions by combining the 
operations of addition, subtraction, multiplication, division and functional composition is 
continuous wherever it is defined. 


11.2 PARTIAL DERIVATIVES; GRADIENT 


In Chapter 4, we considered the first-order and higher-order derivatives of a function of 
one independent variable. In the following, we discuss how to find derivatives of a function 
depending on more than one variable. First, we assume that function f with z = f(x,y) 
depends on two independent variables x and y, and we introduce the notion of a partial 
derivative with respect to one of the two variables. 


Before proceeding, we introduce the notion of an e-neighbourhood U. (x?) of a point x? є 
Dy © К". We denote by Ue (x?) with e > 0 the set of all points x є IR" such that the Euclidean 
distance between the vectors x and x? is smaller than e, i.e. 


U.(xX)) = {x e R” 


In the case of n = 2 variables, the above definition of an e-neighbourhood of point (x°, y?) 
includes all points that are within a circle (but not on the boundary of the circle) with the 
centre at point (x?, y?) and the radius є. 


Definition 11.3 Let function f : Dy > R, Dp € R2, be defined in some neighbour- 
hood U. (x9, y?) of the point (x?, y?) є Dy. Then function f is said to be partially 
differentiable with respect to x at point (x°, y?) if the limit 

п LE ey - FP") 


li 
Ax—0 Ax 
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exists, and f is said to be partially differentiable with respect to y at the point (х0, y?) 
if the limit 
tm Те? + A» оу) 
im 
Ay—0 Ay 


exists. 
The above limits are denoted as 


Gey) resp. СИ 


апа are called partial derivatives (of the first order) of function f with respect to x and 
with respect to y at point (х9, y). 


We also write 


af (x9, 9) af x9, y?) 
MEM and E qi 
дх ду 


for the partial derivatives with respect to x and y at point (х0, у0). If for any point (х,у) € Dy 
the partial derivatives f; (x, y) and f,(x,y) exist, we get functions f; and f, by assigning to 
each point (x, y) the values f; (x, y) and f, (x, y), respectively. Finding the partial derivatives 
of function f with z = f (x,y) means: 


(1) to find f(x,y), one has to differentiate function f with respect to variable x while 
treating variable y as a constant, and 

(2) to find f(x,y), one has to differentiate function f with respect to variable y while 
treating variable x as a constant. 


The partial derivatives at some point (x9, y?) correspond again to slopes of certain straight 
lines. We can give the following interpretations. 


(1) Geometric interpretation of f.(x°,y°). The slope of the tangent line fr (х) to the curve of 
intersection of the surface z = f(x,y) and the plane у = y? at the point (х9, у?, 20) оп 
the surface is equal to f; (x?, у0). 

(2) Geometric interpretation of f, (x9, y°). The slope of the tangent line fr (y) to the curve of 
intersection of the surface z = f (x, y) and the plane x = x at the point (х0, y°,z°) on 
the surface is equal to dos y?) 


The partial derivatives are illustrated in Figure 11.3. The slopes of the lines fr(x) and fr(y) 
are given by the partial derivatives of function f with z = f (x, y) with respect to the variables 
x and y, respectively, at point (x9, y). Both tangent lines span a plane fT (x, y), which is called 
the tangent plane to function z = f (x, y) at point (x9, y°), and the equation of this tangent 
plane is given by 


frc y) = f (хо, yo) + (хо, yo) - (x — xo) + fy (хо, yo) - (У — yo). 
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Xo a 


Figure 11.3 Partial derivatives of function f with z = f (x, y) and the tangent plane fT (х, y). 


Example 11.4 Assume that the total cost C of a firm manufacturing two products A and В 
depends on the produced quantities. Let x be the number of produced units of product A, y 
be the number of produced units of product В and let the total cost C(x, y) be given as 


С(х,у) = 200 + 22x + 167, — Do = R2. 


The first term of function C represents the fixed cost and the other two terms represent the 
variable cost in dependence on the values x and y. We determine the partial derivatives with 
respect to x and y and obtain 


3 
CGy) 222 and Обу) = 5 + 16- VF 24 uy. 


C,(x,y) is known as the marginal cost of product A, and Су(х, у) as the marginal cost of 
product B. The partial derivative C,(x,y) = 22 can be interpreted such that, when the 
quantity of product B does not change, an increase of the production of product A by one unit 
leads to an approximate increase of 22 units in total cost (independent of the concrete value 
of variable x). When keeping the production of A constant, the marginal cost of product B 
depends on the particular value of variable y. For instance, if y — 16, then an increase of the 
production of B by one unit causes approximately an increase of C, (х, 16) = 24. М16 = 96 
units in total cost, whereas for y = 100 the increase in total cost is approximately equal to 
C, (x, 100) = 24- /100 = 240 units. 
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The previous considerations can be generalized to functions of more than two variables. 
In the following, we consider a function f with z = f (x1,x2,...,x4) depending оп the n 
independent variables x1,x2, . . . , Xn- 


Definition 11.4 Let function f: Dy > R, Dy С R”, be defined in some neighbour- 
hood С, (х0) of point x? = Ge хў s E ) € Dy. Then function f is said to be partially 
differentiable with respect to x; at point x if the limit 


0 0 0 0 0 0 {0 0 
mone CEA ОД dg rede a 
Axj—0 Ax; 


exists. The above limit is denoted by 
f (x) 


and is called the partial derivative (of the first order) of function f with respect to x; at 
point xo = (Хх КОР ЫШ). 


We also write 


af (9, х0, = a) 
OX; 


for the partial derivative of function f with respect to x; at point x°. The partial derivative 
f; (X9) gives approximately the change in the function value that results from an increase of 
variable x; by one unit (i.e. from x? to x? + 1) while holding all other variables constant. 
That is, partial differentiation of a function of n variables means that we must keep n — 1 
independent variables constant while allowing only one variable to change. Since we know 
already from Chapter 4 how to handle constants in the case of a function of one independent 
variable, it is not a problem to perform the partial differentiation. 


If function f is partially differentiable with respect to x; at any point x € Dy, then function f 
is said to be partially differentiable with respect to x;. If function f is partially differentiable 
with respect to all variables, f is said to be partially differentiable. If all partial derivatives 
are continuous, we say that function f is continuously partially differentiable. 


Example 11.5 We consider the cost function C : RÀ — R of a firm producing three 
products given by 


C(x1,32,x3) = 300 + 2x1 + 0.5x2 + J/xixa + хз, 


Functions of several variables 391 


where x; denotes the produced quantity of product i € {1,2,3}. For the partial derivatives, 
we obtain 


x2 


2./x1xX2 + x3 А 


ХІ 
2./x1xX2 + х3 А 
1 
2./x1x2 + X3 i 


Син) = 2 + 
Су, (x1,x2, x3) = X2 + 


Cy, (x1,x2, x3) = 


Next, we introduce higher-order partial derivatives. Note that the first-order partial 
derivatives of a function f : Dy — IR, Dy С К”, are in general again functions of n vari- 
ables x1,x2, . .., x4. Thus, we can again determine the partial derivatives of these functions 
describing the first-order partial derivatives and so on. To illustrate this, consider a function 
f withz = f(x,y) depending on n = 2 variables. We obtain: 


af (x, af (x, 
доу) = FEY = абу); FG y 
x ду 
af (х,у) дд(х,у) af (х,у) дү, у) 
fy) = = = (х,у); убу) = oe р by) 
af (х,у)  дд(х,у) Of(xy) dh(x,y) 
fy = y = = 6005 Лабу) = T a = бу). 


Here the notation f;, (x, y) means that we first differentiate function f with respect to variable 
x and then with respect to variable y, i.e. 


д (FEN) _ foy) 
ti xy (x > у ) = = ы 

ду дх дудх 
Analogously, we can consider mth-order partial derivatives that include m successive partial 
differentiations with respect to certain variables. If all partial derivatives of mth-order are 
continuous, we say that function f is m times continuously partially differentiable. 


Example 11.6 Let function f : R? — [—1, 1] with 
z = f(x, y) = sin(3x — y) 


be a function oftwo independent variables. We determine all second-order partial derivatives 
of function f and obtain: 


icy) = 3 cos(3x — y); fix(x,y) = —9sin(3x—y); Ду(х,у) = 3 sin(3x — у); 
Sy@,y) = – соѕ(3х — у); fix(x,y) = 3 sin(3x — у); Sy, y) = — sin(3x — y). 
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We observe that in Example 11.6 the mixed second-order partial derivatives fi, (x, y) and 
fx Gy) are identical. We now ask for a condition under which the latter statement is 
necessarily true. This question is answered by the following theorem. 


THEOREM 11.1 (Young's theorem) Let two mth-order partial derivatives of function 
f: Dy > R, Dy € R”, involve the same number of differentiations with respect to each 
of the variables and suppose that they are both continuous. Then the two partial derivatives 
are necessarily equal at all points x € Dy. 


If in Theorem 11.1 only a particular point x? 


€ Dy is considered, it suffices to assume 
that both mth-order partial derivatives are continuous only in some neighbourhood U. (x?) 
with є > 0 to ensure equality of both mth-order partial derivatives at point x?. Hereafter, 
for the sake of simplicity in the presentation, we always suppose in what follows that the 
corresponding assumptions on partial differentiability or continuous partial differentiability 


of a function f hold for all points of the domain Dy. 


A special case of Theorem 11.1 is obtained when n — m — 2. Then Young's theorem may be 
interpreted as follows: if functions f;, and fyx are continuous, then fey (x, y) = fy (x, y) holds 
at all points (x, y) € Dy. 


Definition 11.5 Let function f: Dy > IR, Dy С IR", be partially differentiable at 
point x? € Dy. Then the vector 


Ut (^) 
Berean lee H | 
P 


is called the gradient of function f at point x? є Dy. 


The vector grad f (x?) has an important geometric interpretation. It gives the direction in 
which function f at point x? є Dy increases most. 


Example 11.7 We consider function f : R? > R with 


1 
fG1,x2) = 2x? + 522 


and determine the gradient at point x? = on. x9 ) = (1,2). First, we find the partial derivatives 
f 01,32) = 4x1 and fy (1, x2) = x2 


which yields 


grad f(x1,x2) = ( TH ) 


x2 
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and thus 
4 
grad f(1,2) = 2] 
An illustration is given in Figure 11.4. Consider the level curve 
2,1, А 0 0 
Рот, хо) = 2x7 + 5% = с with c —f(xj,x5) = 4 
which is an ellipse 
2 2 
оао 
— + 2 = 1 
2 8 


with the half axes a = 4/2 and b = 4/8. Consider the tangent line to this level curve at 
point (1,2). From Figure 11.4 we see that the gradient at this point, i.e. vector (4, 2)", is 
orthogonal to the tangent line to the level curve at point Gia ) = (1,2). It can be shown 
that this property always holds. 


124 grad f(1,2) = (4,2) 


2x? + 23/2 = 4 


A 


Figure 11.4 The gradient of function f with f (x1, x2) = 2x? + х2/2 at point (39,3) = (1,2). 


Example 11.8 Suppose that the production function f : RŽ — R+ ofa firm is given by 


f(x,y) = Зх2у + 0.5хе?, 
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where x stands for the amount of labour used and y stands for the amount of capital used. We 
obtain the gradient 


_ ( oxy +0.5e” 
grad f(x,y) = ( 3x? + 0.5хе? ) 


If the current inputs are х0 = 10 and у? = In 12, then, in order to get the biggest increase in 
output, we have to determine the gradient at point (x°, y°), and we obtain 


0 o, / 60In12+6\ _ ( 155.09 
grad (6^5 = ( 300460 J^ 360 J’ 


So, to increase the output as much as possible, the firm should increase labour and capital 
approximately in the ratio 155.09 : 360, i.e. in the ratio 1:2.32. 


11.3 TOTAL DIFFERENTIAL 


Partial derivatives consider only changes in one variable where the remaining ones are not 
changed. Now we investigate how a function value changes in the case of simultaneous 
changes of the independent variables. This leads to the concept of the total differential 
introduced in the following definition. 


Definition 11.6 Let function f : Dy — R,Dy C R”, be continuously partially 
differentiable. Then the total differential of function f at point x? = GS dotem 0) 15 
defined as 


dz = fe (х0) dxi + fa (x9) dxa + +++ + f (x°) din. 


The terms fy; (х0) dx;, i= 1,2,...,n, are called partial differentials. 


The partial differential fy, (x°) dx; gives approximately the change in the function value when 
changing variable x; at point x? by Ax; = dx;, i € {1,2,...,n}. The total differential is 
illustrated in Figure 11.5 for the case n = 2. We consider the points x? and x? + Ax, where 
AX = (Ax, Ax2). Then the total differential gives the difference in the function value of 
the tangent plane at point x? + Ax and the function value of function f at point х0. If the 
changes Ax; and Ax» are small, then the total differential dz is approximately equal to Az, 
which gives the difference in the function values of function f at the points x? -- Ax and x?. 


In other words, in some neighbourhood of point х0, we can approximate function f by the 
tangent plane fr (x1, x2). 
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x9 + An, 21 


Figure 11.5 The total differential of function f with 2 = f(x, у). 


Example 11.9 We determine the total differential of function f : Dy — IR with 
x 
z = f(x,y) = Intan -. 
У 


For the first-order partial derivatives we get 


1 


y 1 2 
y 
Sey) = x 2x — X x 7 к ax? 
tan ў cos E ysin E cos 3 ysin F 
х 2 
2 =y —2x 
= y T e 
foy) = = = = 


tan * cos? Х 2 sin * cos Х 2 sin 2€ 
y y у y y y^ sin E 


In the above transformation, we have applied the addition theorem for the sine function (see 
property (1) of the trigonometric functions in Chapter 3.3.3). Then the total differential is 
given by 


2 dx 2x d 2 
dz = f(x,y) dx + f(x,y) dy = — TF - =- z -(y dx — x dy). 
узш узіп — y?sin F 
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Next, we consider an application of the total differential, namely the estimation of the maximal 
error in the value of a function of n independent variables. Assume that we have to determine 
the function value 2 = f(x1,x2,...,X,) at point x? = (х0, х0,...,х0), where some errors 
occur in the independent variables. The errors in these variables usually lead to an error Az 
in the objective function value z. The aim is to estimate the error Az provided that the errors 
Ax1, AX2, ... , Ax, in the variables x1, x2, .. ., x, are bounded by constants Aj, i.e. we have 
[Ах] < Aj. Since all errors Ax;,i = 1,2,..., т, are assumed to be sufficiently small, we 
can apply Definition 11.6 and, because Az ~ dz, we obtain 


| А2 | 5 |42 | = < У 00) [Axil < У Us: А. 


і=1 i=l 


У fa (x9) - Ах; 


i=1 


By means of the last formula, we have an estimation for the maximal absolute error Az in 
the function value at point х0 provided that the absolute error in the variable x; at point x? is 
bounded by a sufficiently small constant A;, i € {1,2,...,n}. 


Example 11.10 Assume that the radius R and the height H of a cylinder are measured with 
R = 5.05 + 0.01 cm and Н = 8.20 + 0.005 cm, i.e. Ro = 5.05 cm, Ну = 8.20 cm, 
|dR| < 0.01 cm and |dH| < 0.005 cm. We wish to estimate the absolute and relative errors 
of the volume V of the cylinder. We have 


Vo = V (Ro, Ho) = 1 Ri Ho = 656.97 ст?. 
For the total differential we get 


aV aV 
dV = — dR + — ан = 2x RoHo dR + n R2 dH. 
V oR 4 + 3H z Ro Ho + лКо 


Inserting Ro = 5.05, Но = 8.20, |dR| < 0.01 and |dH| < 0.005, we obtain the estimate 


|AV| ~ |dV| < |2z RoHo| - |dR| + |x R2] - |4H| 
< 82.827 - 0.01 + 25.50257 - 0.005 = 0.9557125z < 3.01 cm? 


(note that we have used the triangle inequality to get the estimate given above), i.e. 


Vo © 656.97 + 3.01 cm?. 


For the relative error we get 


dV 
Vo 


x | = 


< 0.005. 


AV 
Vo 


27x RoHo dR + xR =] РР E 


Ro Ho R H 
Thus, for the given maximum errors in the variables, we get the estimate that the relative 
error for the volume is less than 0.5 per cent. 
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11.4 GENERALIZED CHAIN RULE; DIRECTIONAL DERIVATIVES 


In Chapter 11.2, we investigated how a function value changes if one moves along one of 
the axes. The answer was obtained by means of the partial derivatives with respect to one 
of the variables. Now we investigate a more general question, namely how a function value 
changes if we move along an arbitrary curve C. First, we assume that this curve C is given 
in a parametric form 


x = x(t) and y = y(t). 


As an example, we mention the parametric representation of a function f with z(t) = 
f (x(t), y (t)), where the domain is a circle given by all points (x,y) € R? with 


x=rsint, y=rcost; t € [0,27], 
where r denotes the (constant) radius and t is the (variable) angle measured in radians. We 


only note that parameter ¢ often represents the time in mathematical models of economical 
problems, i.e. we describe the variables x and y in representation f (x, y) as functions of time t. 


A point (x°, y?) corresponds to some value 1? in the above parametric representation, i.e. 
x? = x(t?) and y? = y(t). 


Accordingly, for some point on the curve C with parameter value t = t° + At, we get the 
representation 


x = x(t? + At) and y= y(t? + At). 


Now we can conclude that, if At — 0, then also Ax — 0 and Ay — 0. Therefore, the 
derivative 


Hey) | jm LEC + Ad, yO + AD) — fa), YE) 


dt At>0 At 
— и © Ах? + A» — SO? у) 
At—0 At 


characterizes the approximate change of function f if parameter t increases from tọ by one 
unit, i.e. one moves along curve C from point (x(fo), y(to)) to point (x(to + 1), y(to + 1)) 
provided that the above limit exists. 


To obtain a formula for the derivative of such a function with variables depending on a 
parameter t, we consider the general case of п independent variables and assume that all 
variables x; , i = 1,2,...,n, are also functions of a variable f, i.e. 


xı = х@), x? = x2(t), T Xn = Xn(t). 
Thus, we have: 


Z = /(х1,х2,...,ха) = (100), х2(0),...,х.(0)) = 2(0). 
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To determine the derivative dz/dt, we have to compute 


dz(x1, x2, . m Xn) 


- lim foa + Ax1, x2 + Ax2, S me ‚Хп + Ах) — f (x1,%2, s, -> Xn) 
dt ^ At At 


$ 


provided that this limit exists, where Ax; = x;(t + At) — x; (t). This can be done by applying 
the following theorem. 


THEOREM 11.2 (chain rule) Letfunctionf : Dy > К, Dy С R”, be continuously partially 
differentiable, where 


X] — x1(t), X2 — x2(t), ee, Xn = Xn(t), 


and functions x;(t) : Dy, > R, i = 1,2,...,n, are differentiable with respect to t. Then 
function 


z(t) = f(x) = f G0), x20), .-- xn) 


is also differentiable with respect to t, and we get 
dx 
= fu c) 1 + Tro c) 22 ese p (x) ae 


The above result can be generalized to the case when the functions x; depend on two variables 
и and v. We get the following theorem. 


THEOREM 11.3 (generalized chain rule) Let function f : Dy — R, Dy С R”, be continu- 
ously partially differentiable, where 


X] — x1(u, v), X2 — x2 (и, v), e Xn = хп(и, 0), 


and functions х; (и, 0) : Dy, > R, Dy, © R2, i = 1,2,...,n, are continuously partially 
differentiable with respect to both variables u and v. Then function z with 


z(u,v) = f (xı (u, v), x2(U, v), . . . , Xn (u, v)) 


is also partially differentiable with respect to both variables u and v, and 


9 Әх, 

sa = = fa w d о) 20 ENIM xa 
u 

д2 Xn 

L = fa E £f о) 29 + fag Q0 d 


Example 11.11 Consider function f : R? > R+ with 


2 
z =f (x1,x2,33) = eT, 
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where x1 — uv, x? — и? + v? and хз = и + 2v. We determine the partial derivatives of 
function f with respect to variables u and v. We obtain 


2 2 2 
2x2— __ XT +2x2—-x __ x*--2x2—x 
fa = еї XATA, fa = 2e 2—08. fa =e" 2—53, 
Ox] Ox] 0x2 0x2 0x3 0x3 
ay, =u =m, 22, 2-1, —=2, 
ди до ди до ди до 


and thus for function 2 = z(u, v) 
д 
д 


д 
д 


Z = fy i uU = +» 22 = Qui? + 4u — 1)e PH UH), 
u u u u 


= fay ptt haa pe thin Ge = Quo + Ao — Die a20), 
U U v 0 


Note that the same result is obtained if we first substitute the functions x1 = х (и, v), xo = 
x2(u,v) and хз = x3(u,v) into function f with z = f'(x1,x2,x3) and determine the partial 
derivatives with respect to variables u and v. 


So far we have considered partial derivatives which characterize changes of the function 
value f(x) in a direction of the axis of one variable. In the following, we deal with the 
change of f (x) in a direction given by a certain vector r = (ri,72,..., га). This leads to 
the introduction of the directional derivative given in the following definition. 


Definition 11.7 Let function f : D; > К, Dy © R”, be continuously partially 
differentiable and r = (r1,72,...,74) bea directional vector with |r| = 1. The term 


[grad /(x]T -r = [Vf] -r 
= (0) f Gm. kf (0) rn 


is called the directional derivative of function f at point x° = (S e ru) € Dy in 
the direction r. 


Notice that the directional derivative is a scalar obtained as the scalar product of the vectors 
grad f (x?) and г. The assumption |r| = 1 allows us to interpret the value of the directional 
derivative as the approximate change of the function value if we move one unit from point x? 
in direction г. The directional derivative of function f at point x in the direction г is illustrated 
in Figure 11.6. It gives the slope of the tangent line f (г) to function f at point x? along the 
direction given by vector г. If vector rê = (0,...,0,1,0,...,0)" is the ith unit vector, the 
directional derivative in direction г! corresponds to the ith partial derivative, i.e. 


[grad f(x°)]! - ri = f, (х0). 


400 Functions of several variables 


Figure 11.6 The directional derivative of function f at point x° in the direction r. 


Thus, partial derivatives are special directional derivatives. For the directional derivative of 
function f at point x° in the direction r, we also write 


0 
ar ). 


Example 11.12 The total cost function f: RÌ — R ofa firm producing the quantities х, x2 
and x3 of the three goods 1, 2 and 3 is given by 


f @œ1,x2,x3) = 20 +x? + 2x2 + 3x3 + 2x1x3. 


Let the current production quantities of the three goods be x? = 10,49. — 15 and xQ = 20. 
Assume that the firm has the chance to increase its production by exactly five units, namely 
two products by two units and the remaining product by one unit, and we want to know which 
change results in the smallest increase in cost. This means that we have to investigate the 
directional derivatives in the direction of the vectors 


1 2 2 
r! = Я 2 = | 1 апі т? = 
2 2 1 


First, we consider direction r! = (1,2,2)T, and determine the directional derivative. The 
gradient of function f is given by 


2x1 + 2x3 
grad f (x1,x5,x3) = 2 
3 + 2х1 
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and therefore at point x? = 6,30 2 — (10,15,20) by 


60 
grad f(x°)= | 2 
23 


1 


Since F! is not a unit vector, we have to divide vector F! by its length 


Е = V12 +22 +22 = V9 = 3, 


which yields vector 


We obtain for the directional derivative [grad f(x°)]'(r!) = (af)/(ar!) at point x? in the 
direction r!: 


à 1 4 І 
КАО = [grad f(x")]" —- = z - (60,2,23)- | 2 
r Ir | 2 


1 110 
eg eee есы. 


How can we interpret the directional derivative (9f)/(dr!)? When moving from x one unit 
in the direction of F! (i.e. from x? to x? + г!), the increase in cost is approximately equal 
to 110/3. Equivalently, if we increase the production of good 1 by one unit, and of goods 2 
and 3 by two units each, the approximate increase in the cost is 3 - 110/3 = 110 units. 


For the directional vector F? = (2, 1,2)", we obtain first the unit vector 


E! 
2 Г 

r=—=-] 1 

P 3 
and then the directional derivative at point x? in the direction r?: 
2 

à ro | 

f (х0) = [grad / (хОу. 5. = 1. (60,225. | 1 

ar? r? 3 2 


1 1 
ата ыы л а ода: 


Accordingly, for the directional vector г? = (2,2, 1)T, we obtain first the unit vector 
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and then the directional derivative at point x? = (10, 15,20) in the direction r?: 


za 2 
af о 0\7T г? 1 
а -— = — - (60,2,23)- | 2 


1 1 
5 `(60-2+2.2+23.1) = =: 147 = 49. 


Thus the cost has the smallest increase in the case of extending production of the three goods 
according to vector г! = (1, 2,2)", and the firm would choose this variant. It is worth noting 
that we can here compare immediately the values of the directional derivatives since all 
vectors r!, r?, Г considered have the same length 3. 


11.5 PARTIAL RATE OF CHANGE AND ELASTICITY; 
HOMOGENEOUS FUNCTIONS 


Asinthe case ofa function of one variable, we can define a rate of change and the elasticity of 
a function with respect to exactly one ofthe n variables keeping the remaining ones constant. 
For instance, if price and income may be considered as the two independent variables of 
a demand function, we can distinguish between price and income elasticity of demand. In 
this section, we generalize the rate of change and elasticity of a function of one variable 
introduced in Chapter 4.4 to the case of functions of several variables. 


Definition 11.8 Let function f : Dy > IR, Dy С R”, be partially differentiable and 
х? = (ух Je 1524 € Dr with f (x?) Æ 0. The term 


f, Q9) 
fx) 


fna) = 


is called the partial rate of change of function f with respect to x; at point х0. 


Example 11.13 Let function f : RŽ \ {(0,0)} > R with 
fy =x: pee n 


be given. We determine the partial rates of change of function f at point (20,4) and obtain 


1 
fey) = Vy ele + Put y extra = VELA (1+ >) 


_ _Х_ ауа у ЭМУ xy  2+У руа 
Ay y) е + е x-e : 
2/¥ 4 4/7 
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This yields 
d. 1+% 2+x 
Pf xy) = = = 
Го, у) х 2х 
апа 
РИ), _ Љу) _ 2+У 
POE хуу ду 


For point (20, 4), we obtain 


0f4(20,4) = 0.55 and  pfy(20,4) = 0.375. 


Definition 11.9 Let function f : Dy > IR, Dy С R”, be partially differentiable and 
x? = (xs me me) € Dy with f (x?) Æ 0. The term 


Xi ( v) 
£r, (0) = 0 pra Q0) = х0 AS 


is called the partial elasticity of function f with respect to х; at point x°. 


For point x? є Dy, the value ғу, (х0) is approximately equal to the percentage change in the 
function value caused by an increase of one per cent in the variable x; (i.e. from x to 1.0 1х0) 
while holding the other variables constant. 


One important class of functions in economics is the class of so-called homogeneous 
functions, which we introduce next. 


Definition 11.10 А function f : Dy > R, Dy € R”, is said to be homogeneous of 
degree k on Dy if > 0 and (x1,x2,..., x4) € Dy imply (tx1, tx2, . .., Xn) € Dy and 


JCAL oc An) = A -f (X1,X2,...,Xn)  forallt > 0, 


where k can be positive, zero or negative. 


According to the latter definition, a function is said to be homogeneous of degree k if the 
multiplication of each of its independent variables by a constant t changes the function value 
by the proportion ¢*. The value t can take any positive value provided that (tx, tx», . . . , Xn) 
belongs to the domain Dy of function f. We mention that many functions are not homogeneous 
of a certain degree. The notion of a homogeneous function is related to the notion 'returns to 
scale' in economics. It is said that for any production function, there are constant returns to 
scale, if a proportional increase in all independent variables x1,x2?, . . ., x, leads to the same 
proportional increase in the function value (output) f (x1, x2, .. . , x4). Thus, constant returns 
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to scale would mean that we have a homogeneous function of degree k = 1. If there is a 
larger increase in the function value than in the independent variables (i.e. taking ¢ times 
the original values of the input variables, the production output increases by a bigger factor 
than f), it is said that there are increasing returns to scale. This corresponds to a homogeneous 
function of degree k > 1. Finally, if there is a production function with decreasing returns 
to scale, then it corresponds to a homogeneous function of degree k < 1. We continue with 
two properties of homogeneous functions. 


THEOREM 11.4 (Euler's theorem) Let function f : Dy — IR, Dy С К", be continuously 
partially differentiable, where inequality ¢ > 0 and inclusion x = (x1,x2, ..., x4) € Dy imply 
inclusion (tx1, 1x2, . . ., xs) € Dy. Then function f is homogeneous of degree k on Dy if and 
only if the equation 


Xp fe (Ж) + хо fe, Q0) +...+ x» 5, (0) = Ё-/(х) (11.1) 
holds for all points x = (x1, x2,...,xn) € Dy. 
In gradient notation, we can express equation (11.1) in Euler's theorem as 

x! . grad f(x) = k - f (x). 
Applying the definition of partial elasticity, we obtain the following corollary. 
COROLLARY 11.1 Let function f : Dy > R, Dy С R”, with f(x) Æ 0 for x € Dy, be 
homogeneous of degree k. Then the sum of the partial elasticities of function f is equal to К, 
i.e. 


£f (X) + ex, (X) +... ep, (X) = К. 


Example 11.14 Consider a Cobb-Douglas production function Q : RŽ — R with two 
variables K and L given by 


ОК) =4: K^ . 19, 


where K denotes the quantity of capital and L denotes the quantity of labour used for the 
production of a firm. Determining the partial derivatives, we get 


Ox(K,L) = А: ај: Ка! . L” апі Qr(K,L) = А. Ка . az- L! 


The partial derivative Ок is also known as the marginal product of capital and Qr as the 
marginal product of labour. 
For the partial elasticities at point (К, Г), we obtain 


A-a,-K%-1L2 


€x, D —K.— аграр 


=a, and 
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A- K” . a. L2! 
ef L(K,L) = 1. J Ka. Па = a2. 


Thus, the exponents in the Cobb-Douglas function correspond to the partial elasticities. 
Finally, from the equalities 


QUK, tL) = A- (К) - GL)? = + . OCK, D), 


we find that function Q is homogeneous of degree а] + a2. 


11.6 IMPLICIT FUNCTIONS 


In our previous considerations, the function was defined explicitly, i.e. we considered a func- 


tion of one or several variables in the form: y = f(x) and z = f (x1, x2, . . . , Xn), respectively. 
However, in many applications a relationship between two variables x and y may be given in 
the form: F(x,y) = 0, or correspondingly, Е(х1,х2,...,хп;2) = 0. In the former case, 


we say that y is implicitly defined as a function of x if there exists a function f with y = f (x) 
such that F (x, y) = F(x,f(x)) = 0. 


Example 11.15 Let 


2 2 


xt у 
ate lao. 


The above equation defines two elementary functions of the form 


2 
X 
yi2 +61- 
a 


and 


— 1 x? 
y2 rx ai 


Each ofthe above functions defines half an ellipse (see Figure 11.7) with the half-axes a and b 
and the centre at point (0, 0). Inserting the above explicit representations into Е (x,y) = 0, 
we obtain in both cases the identity 
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Figure 11.7 The ellipse x? /a? + 2/5? — 1 = 0. 


However, not every implicit representation F (x, y) — 0 can be solved for y, e.g. the equation 
2у6 + y — lnx = 0 cannot be solved for y. 


Similarly, one can consider an explicitly given system in the form: 


yı > Ai @1,%2,- ss Xn) 
y2 Im Р(1,2х2,. we Xn) 


(11.2) 
Ут = Sm(X1,%2,---,Xn)- 
Such a system of equations might be given in implicit form: 
Fy (1, x2, ie Xni JbJ2; 255 Ут) = 0 
Fy (x1, x2, ** Хр У1, 2, ©» Ут) = 0 
| (11.3) 


Е, (х1,22,. $us ‚Хп; У1,у2, . a Ym) = 0. 


Representation (11.2) is referred to as the reduced form of system (11.3). The next problem 
we deal with is to find an answer to the following question: under what conditions is it 
possible to put a system given in the form (11.3) into its reduced form (11.2)? This question 
is answered by the following theorem. 


THEOREM 11.5 Let a system (11.3) of implicitly defined functions with continuous partial 
derivatives with respect to all variables be given and (х0; у?) = (a2, x) ERE х0; ve 35; M y9) 
be a point that satisfies system (11.3). If the matrix of the partial derivatives of the functions 
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F; with respect to the variables y; is regular at point (x9; y?, ie. 


#0 (|J(x9; y9)| is the Jacobian determinant), 


дЕ; 0. ,0 
гео; y^ = det ( 227027) 
дук 


then system (11.3) can be uniquely put into its reduced form (11.2) in a neighbourhood of 
point (x9; уд). 


We illustrate the use of Theorem 11.5 by the following example. 


Example 11.16 Consider the implicitly given system 
FiGQa,xaiyiy2) = хуу — x2 — y2 = 0 (11.4) 
Fo(x1,%25 91,92) = жуз — xi t y1 = 0, (11.5) 


and we investigate whether this system can be put into its reduced form. Checking the 
Jacobian determinant, we obtain 


oF, OF] 

ду\ дуз X1 —1 2 
J(x1,X2;y1, = = = 1. 
|J 01,2: y1. Y2)] ӘР) ӘР» 1 x xix; + 

дур ду» 


According to Theorem 11.5, we can conclude that ће given system can be put into its reduced 
form for all points (x, y) = (x1,x2; y1, y2) with xix? + 1 0 satisfying equations (11.4) and 
(11.5). Indeed, from equation (11.4) we obtain 

y2 = ху — X2. 
Substituting the latter term into equation (11.5), we get 

xi — x2) —x1 yi =0 
which can be rewritten as 

(х1х5 + Dy = х1 +23. 


For xix? + 1 0, we get 


xı + 3) 
1 dem MT 
4 xps +1 


and 


xı + 3 x + x1x3 = xix; — X2 x — X2 
y2 — . X2 — 


xix? +1 xix? +1 B х\хў Je 
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If in the representation F (x1, x2,...,Xn3Z) = 0 variable z can be eliminated, we can perform 
the partial differentiation with respect to some variable as discussed before. However, if it 
is not possible to solve this equation for z, we can nevertheless perform the partial differ- 
entiations by applying the following theorem about the differentiation of an implicitly given 
function. 


THEOREM 11.6 (implicit-function theorem) Let function F : Dr — R, Dr С "+1, 
with F(x;z) = F(x1,x2,...,Xn3Z) be continuous and F(x°,z°) = 0. Moreover, let function F 
be continuously partially differentiable with F,(x°,z°) 5 0. Then in some neighbourhood 
U. (x?) of point x with є > 0, there exists a function f with z = f (x1,x2,.. ., x4), and 
function f is continuously partially differentiable for x € О, (х0) with 


Fy; (x; 2) 
Е,(х;2)` 


f; (X) = fu (1,22, . А . Xn) = — 


For the special case of n = 2, the above implicit-function theorem сап be formulated as 
follows. Let F (x,y) = 0 be an implicitly given function and (х0, y?) be a point that satisfies 
the latter equation. Then, under the assumptions of Theorem 11.6, for points x of some 
interval (a, b) containing point хо the first derivative of function f with y = f(x) is given by 


Fy, у) 


ше 


(11.6) 


Example 11.17 We determine the derivative of function f with у = f(x) implicitly 
defined by 


Е (x,y) = &? — 24xy’ + 16y? = 0 
by means of the implicit-function theorem. This yields 
Е,(х,у) = 24x? —24y?, Р(х,у) = —48xy + 48y?, 
and we obtain 


у A ху 
—48y(x — y) 2y 
As an alternative, we can also directly differentiate function F(x,y(x)) — 0 with respect 


to x. In this case, using the product rule for the second term of F (x, y) (since y depends on x), 
we get 


24x? — 24y? — 48xyy’ + 48у?у = 0, 
from which we obtain 

48y'y(y — x) = 24 — x?) 
and consequently 


‚_ XHY 


2у 
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The latter formula allows us to determine the value of the first derivative at some point 
(x, y) with F(x, y) = 0 without knowing the corresponding explicit representation y = f (x). 
This is particularly favourable when it is impossible or difficult to transform the implicit 
representation F (x, y) = 0 into an explicit representation y = f(x) of function f. 


Using formula (11.6), we can determine the second derivative y" of function f. By applying 
the chain rule, we get 


Й [Fco y) + Fry (x,y) “yx | Fy (x,y) — [Fyx G5, y) + Fy, у) x] Р(х,у) 
> Fly) 


with y, = у = f'. Substituting уу = y' according to formula (11.6), we obtain 


Fry) F26,y) — 2+ Fg (y) Ебу) Бубу) + By sy) Ё?(х,у) 
F3, y) | 


(11.7) 


Example 11.18 Let function f : Dy — R be implicitly given by 
F(x,y) 2x — y t 2siny = 0. 

(Note that we cannot solve the above equation for y.) We determine y' and y". Using 
Fx(x,y)2l and Fy(x,y) = —1+2cosy, 


we get 


1 1 
—1+2cosy 1-—2cosy 


/ 


y = 


Moreover, using 
Р(х,у) = Fox, у) = 0 and Fyy(x, y) = —2 siny, 


we obtain by means of formula (11.7) 


p 0-(—1+2cosy)* —2-0-0-(—1+2cosy) + (—2siny- 1) 
P (—1 + 2cos y? 
p 2siny 
~ (2cosy — D?" 


11.7 UNCONSTRAINED OPTIMIZATION 


11.7.1 Optimality conditions 


In economics, one often has to look for a minimum or a maximum of a function depending 
on several variables. If no additional constraints have to be considered, it is an unconstrained 
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optimization problem. As an introductory example we consider the maximization of the profit 
of a firm. 


Example 11.19 For simplicity assume that a firm produces only two goods X and Y. 
Suppose that this firm can sell each piece of product X for 45 EUR and each piece of 
product Y for 55 EUR. Denoting by x the quantity of product X and by y the quantity of 
product Y, we get revenue R = R(x, y) with 


R(x, y) = 45x + 55y, Dg = RÀ. 


The production cost of the firm also depends on the quantities x and y. Suppose that this cost 
function C — C(x, y) is known as 


C(x, y) = 300 + x° + 1.5? — 25x – 35y, | Dc = RÀ. 


One can easily show that inequality C(x, y) > 0 holds for all non-negative values of variables 
x and y, and for the fixed cost of production one has C(0, 0) = 300. In order to determine the 
profit P = f(x,y) in dependence on both values x and y, we have to subtract the cost from 
revenue, i.e. 


fo» zi R(x,y) ES C(x,y) 
= 45x + 55y — (300 + xà + 1.5? — 25x — 35y) 
= —x? + 70x — 1.5? + 90у — 300. 


To maximize the profit, we have to determine the point (x, y) € R2 that maximizes function f . 
We say that we look for an extreme point (in this case for a maximum point) of function f. 
This is an unconstrained optimization problem since no other constraints of the production 
have to be taken into consideration (except the usual non-negativity constraints of variables 
x and y). 


In the following, we look for necessary and sufficient conditions for the existence of extreme 
points of a function depending on n variables. First, we formally define a local extreme (i.e. 
minimum or maximum) point as well as a global extreme point. 


Definition 11.11 A functionf : Dy — IR, Dy С R”, has a local maximum (minimum) 
at point x? if there exists an €-neighbourhood О, (х?) є Dy of point x? with e > 0 
such that 


f(x) <f(x°) FŒ = f (x9), respectively) (11.8) 


for all points x є 0, (х0). Point x? is called a local maximum (minimum) point of 
function f. 


If inequality (11.8) holds for all points x € Dy, function f has at point х0 a global 
maximum (minimum), and x is called a global maximum (minimum) point of function f . 
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THEOREM 11.7 (necessary first-order conditions) Let function f : Dy > R,Dy С R”, 
be partially differentiable. If x? = ias "e ,x9) € Dy is a local extreme point (i.e. a local 
maximum or minimum point) of function f , then 


grad f(x°) = grad /(х0,х0,...,х0) = 0, 


fy (x) = 0, 5,000) = 0, ..., (0) = 0. 


A point x? with grad f (x?) = 0 is called a stationary point. According to Theorem 11.7 we 
can conclude that, if х? = (x? x9, eie x? ) is a local extreme point (assuming that function f 
is partially differentiable at this point), then it has to be a stationary point. 


In order to determine a global minimum or maximum point of a function f depending on 
n variables, we have to search among the stationary points (which can be found by means 
of differential calculus), the points on the boundary of the domain and possibly among the 
points where the partial derivatives do not exist. This is similar to the case of a function of 
one variable (see Chapter 4). In the following, we focus on the determination of local extreme 
points by differential calculus. 


Next, we are looking for a sufficient condition for the existence of a local extreme point. In 
the case of a function of one variable, we have presented one criterion which uses higher- 
order derivatives. In the following, we present a criterion that uses the second-order partial 
derivatives to answer the question of whether a given stationary point is indeed a local extreme 
point. We begin with the definition of the Hessian. 


Definition 11.12 The matrix 


thax (^) Jis (Y) o Jis. =) 
foa) fex OP) © fox (X) 
Hy (x9) = (О) 22: 2 d 2 А 2. | 


Sanxi (х0) y (x9) en xs (х0) 


is called the Hessian, or the Hessian matrix, of function f at point x? = 


0 „0 
Оев © КО, 


Under the assumptions of Theorem 11.1 (i.e. continuous second-order partial derivatives), 
the Hessian is a symmetric matrix. The following theorems present sufficient conditions for 
so-called isolated (local or global) extreme points, for which in Definition 11.11, the strict 
inequality holds in inequality (11.8) for x Æ x?. 


THEOREM 11.8 (local and global sufficient second-order conditions) Let function f : 
Dy — R,Dy C R”, be twice continuously partially differentiable and let x? = 
aD Ре ,x9) € Dy be a stationary point of function f. Then: 


(1) If the Hessian Hy (x9) is negative (positive) definite, then x? is a local maximum 
(minimum) point of function f. 
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(2) If Hy (x?) is indefinite, then function f does not have a local extremum at point х0. 
(3) If Hy (x) is negative (positive) definite for all points x € Dy, then x? is a global maximum 
(minimum) point of function f over Dy. 


A stationary point x° of f (x) that is neither a local maximum nor a local minimum point is 
called a saddle point of function f . Next, we give a sufficient condition for a stationary point 
to be either a local minimum, a local maximum or a saddle point in the case of a function of 
two variables. 


THEOREM 11.9 (special case of Theorem 11.8 for the case n = 2) Let function f: 
Df > R, Df £ R?, be twice continuously partially differentiable and let (x9, y9) € Dr bea 
stationary point of function f. Then: 


(1) If 
2 
Ноо, уб) = 00,79) fry") — [fe] > 0, 


then (х, y?) is a local extreme point of function f , in particular: 


(a) I£ f (x9, y?) < 0 or Sy (x9, у0) < 0, it is a local maximum point. 
(b) If f (x9, y?) > 0 or Sy (x9, y°) > 0, it is a local minimum point. 


(2) If Ну (x9, y) < 0, then (х0, y?) is a saddle point of function f. 
(3) If [Ну (x, y9)| = 0, then (х0, y?) could be a local maximum point, a local minimum 
point, or a saddle point of function f. 


In case (1) of Theorem 11.9, we can take any of the second-order partial derivatives fex or fyy 
in order to decide whether a stationary point is a local maximum or minimum point. Notice 
also that in the case of a function f depending on two variables, at a saddle point function f 
has a local maximum with respect to one of the independent variables and a local minimum 
with respect to the other independent variable. 


Now we are able to find the extreme point of the profit function f with z = f(x,y) given in 
Example 11.1. 


Example 11.1 (continued) We first establish the necessary first-order conditions according 
to Theorem 11.7 and obtain 


f(x,y) = —2x +70 = 0 
Љб,у) = —3y + 90 = 0. 
Thus, we get the only stationary point P, : (x1, y1) = (35, 30). In order to check the sufficient 


second-order conditions according to Theorem 11.9, we determine the second-order partial 
derivatives 


Љу) = —2, fy@y=-3, Љу) = /ух(х,у) = 0, 
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set up the Hessian 


[Љо fy @y) Jet -2 0 ) 
Hr = (Fer? Sy@y) J 0 -3J" 


and obtain 
[Hy (35, 30)| = f. (35, 30) - fy, (35, 30) — [fe (35, 30) ]” = (—2).(—3)— 0-620. 


Therefore, point Р! is a local extreme point. It is worth noting that in this case the value of 
the determinant of the Hessian does not even depend on the concrete values of variables x 
and y. Moreover, since 


fx (35, 30) = —2 < 0 (or analagously since f,(35, 30) = —3 < 0) 


point P, with (x1, y1) = (35, 30) isa local maximum point of the profit function. The maximal 
profit is equal to P = f (35,30) = 2,275 units. 


Example 11.20 Let function f : R? — R with 
1 1 1 
f@y) = rid + at +x? — 4х — e 


be given. We determine all local extreme points of function f. The necessary first-order 
conditions according to Theorem 11.7 for an extreme point are 


foy) = 2 +xy+2x-4=0 (11.9) 
1 1 
foy = = — гу? = 0. (11.10) 
2 2 
From equation (11.10) we obtain x? = y? which yields |x| = |y| or equivalently y = +x. 


Case 1 у = x. Substituting y = x into equation (11.9), we get 2x? + 2x — 4 = 0, or 
accordingly, 


x!-x—-220. 


This quadratic equation has the two solutions x} = —2 and x? = 1. Thus, we get y; = —2 
and y? — 1, which gives the two stationary points 


Pi: (х,у) = (72, 22) and Рә: (x2, y2) = (1, D. 


Case 2 у = —x. In this case, we obtain from condition (11.9) the equation 2x — 4 = 0 
which yields x3 = 2 and y3 = —2 and the stationary point 


P3 : (Хз, уз) = (2, —2). 


Thus, we have three stationary points, Pı : (—2, —2), P2 : (1, 1) and P3 : (2, —2). Next, we 
check the sufficient second-order conditions according to Theorem 11.9 and obtain 


fx y) = 2x +у+ 2, Sy GG y) =) fy Gy) = /ух(%, у) =x. 
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This yields the Hessian 


e ооз e LEM x ) 
Ну (x, y) md ( Six Qc y) fy y) ка x —y > 


and for the stationary points, we obtain 
IHjC-2, -2)| = f (72, 22 572, -2) - [f C-2, -20]” 
= (—4) -2 — (—2}? = —12 < 0, 
IH, DI = fall, D 50, D — [fo D = 5-(—1) 1? = —6 < 0, 
ІН; (2, —2)| = fa (2, 72) - Ay (2, —2) – [Жу 2, -[ =4.2-27?=450. 


Thus, according to Theorem 11.9, P, and P» are saddle points, and only point Рз is a local 
extreme point of function f . Due to f; (2, —2) = 4 > 0 (or accordingly fj, (2, —2) = 2 > 0), 
point P3 is a local minimum point with the function value f (x3, уз) = —4. 


We continue with another example of finding the maximum profit of a firm. 


Example 11.21 We consider the profit maximization problem of a firm that uses two input 
factors F; and F; to produce one output. Let P be the constant price per unit of the output, x be 
the quantity of factor F1, y be the quantity of factor Fz as well as pı and p2 be the prices per 
unit ofthe two inputs. Moreover, function f : R2 — Ris the production function of the firm, 
i.e. f (x, y) gives the quantity produced in dependence on quantities x and y. Thus, Pf (x, у) 
denotes revenue from which cost of production, i.e. рух + p2y, has to be subtracted to get 
the profit function g : R? — R. This problem can be written as 


g(x,y) = P -f (х,у) — pix — poy > max! 


The necessary first-order conditions (according to Theorem 11.7) for the optimal choices of 
the input factors А and F are as follows: 


gx y) = Py) -pi-0 
gy x,y) = PfyGx, y) — р) = 0. 


Let us suppose that the production function f is of the Cobb-Douglas type given by 
fü =x yh. 
Then the two necessary first-order optimality conditions according to Theorem 11.7 become 


1 E 
gy) = 5 Pox 2.1/3 р =0 


gay) = = Px 
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Multiplying the first equation by x and the second one by y, we get 


1/2. 


У 


-x yl? — pix 20 


КОЕ s = ру 


Moreover, multiplying the first equation by —2/3 and summing up both equations gives 


2 

5 Pix – ру = 0 
3 

and therefore 


2pi 
к=  X 
3p2 


Substituting this into the above equation g, (x, y) = 0, we get 


1 —1/2 {2Р1 i 
L.P. x2. | Z. —p =0. 
2 ? E i A 


The latter can be rewritten as the equation 


1 2 1/3 
; | P «xC 1241/3) , (2) = р 
P2 


from which we obtain 


6 
P (21). 
X — к= # IT. 
2p, \3pe2 


and thus 


(2): (Ж) - m) (ы) 
XQ = . = = . А 
e = Mn 3m) 0р1): Gp \2р! 3p) 


Furthermore, we get 


m$) (у) (ж) (ж) 
yo = “xo = 3 * = x 2 
3p; 3p; \2pı 3р2 2pi 3р2 


Thus, we have obtained the only stationary point 


wC ETETE) 


for the quantities of both input factors F; and Р». 
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Checking the sufficient second-order conditions according to Theorem 11.9, we get 
l -3/2 „1/3 2 р 125/3 
Аа т DA , gp egre oy , 


1 ET cad 
$505) = Gye (ty) = = Px 1/2 , y—2/3 


and consequently, 


1 1 
lo .powap d Dog ee Кс: 
Ec ву» |_ | а E NEM (C Mr. 
Hr y)l = духу) фуу) | | 1 2 5/3 
yx (X, yy X, PU RM г. Су. 
6 9 
2 1 1 
= 2 .pl.xycl.y745 2 .pl.xycl.y740 = 07 .pl.y-1.,74/ 
36 MESE 36 ж 36 ах: 


Since хо and yo are positive, we have obtained inequality |Hy (хо, yo)| > 0 which means that 
(хо, yo) is a local extreme point. Since gxx(xo, yo) < 0, (хо, уо) is a local maximum point 
according to Theorem 11.9, and the corresponding supply is given by 


i 271/2 3 31/3 
_ a2 WB 28 (2 А =) (2) 
Р (Xo, y0) = хо `Yo -|() (s) | [Ё 3р2 
Р \? (Р \? 
z x: 


Example 11.22 Each oftwo enterprises E, and E» offers a single product. The relationships 
between the outputs хү, x2 and the prices pı, p» are as follows: 


xı = 110 — 2p, — p2, 
х = 140 — pi — Зр». 


The total costs for the enterprises are given by 


Ci (x1) — 120 + 2x1, 
C2 (x2) = 140 + 2х2. 
(a) First, we determine the profit functions fi, f2 of both enterprises and the total profit function 


f = ЛА fo. We recall that the profit function fj, i € {1,2}, is obtained as the difference of 
the revenue х;р; and the cost C; = C;(xj), i.e. 


Ai@1,p2) = xipi — Ci (x1) 
= (110 — 2p; — pz): pı — [120 + 2(110 — 2p; — р2)] 
= 110p, — 2p? — pip2 — 120 — 220 + 4p; + 2p2 


= —340 — 2p? — pip2 + 114p1 + 2р2, 
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epi. p2) = хор — C362) 
= (140 — р — 3р2) : p2 — [140 + 2(140 — р — 3p;)] 
= 140p; — pipa — 3р5 — 140 — 280 + 2p1 + 6p; 


= —420 — pipa — 3р5 + 2p1 + 146p2. 
This yields the total profit function f = f, + f as follows: 


f (pi, p2) = —760 — 2p? — 2pip2 — 3р5 + 1M6pi + 148p». 


(b) Next, we determine prices p; and p» such that the total profit is maximized. Applying 
Theorem 11.7, we get 


fpi (D, p2) = —4pi — 2р2 + 116 = 0 
fp, (pi, pa) = —2pi — 6p2 + 148 = 0. 


The latter system can be rewritten as 


2pi + p2 = 58 
pi + 3р2 = 74, 


which has the unique solution 

py = 20 and p = 18. 
Checking the sufficient conditions according to Theorem 11.9, we obtain 

Н, (20, 18)| = fp, (20, 18) - fp, pa (20, 18) — [foi (20, 18] 

= (—4) - (-6) — (-2 = 20 > 0. 

Therefore, the stationary point obtained is a local extreme point. From 

Jpı pı (20, 18) = —4 < 0, 
we have found that (po, р?) = (20, 18) is a local maximum point with the total profit 

f (20, 18) = —760 — 2. (20? — 2.20.18 — 3 - (18)? + 116.20 + 148 - 18 = 1,732. 
(c) Following some problems between the enterprises, E» fixes the price р» = 22. Next, we 
want to investigate which price p? would ensure a maximum profit fj for enterprise Еј under 
this additional condition. In this case, we get a function / with 

fi@i,22) = —340 — 2p? — 22pi + 114p; + 44 = —296 — 2p? + 92pi 


which depends now only on variable pı. We denote F (p1) = fi (p1, 22). Applying the criterion 
given in Chapter 4.5 (see Theorem 4.12), we get as a necessary condition 


F'(p) = —4p1 + 92 = 0 
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which yields the only solution 
рі = 23. 

Since 
F" (pï) = —4 < 0, 


the solution obtained is indeed the maximum profit of enterprise E; under the above constraint 
р» = 22. From equality f (23,22) = 1,642, the total profit of both enterprises reduces by 
1,732 — 1,642 = 90 units in this case. 


Example 11.23 The profit function P : RÀ — R for some agricultural product is a function 
of the quantities used of three sorts of chemical fertilizers. Let function P with 


1 


P(Ci, C5, Сз) = 340 — 3 


1 1 
Ci + „Сї + 2С\С› - С? + QC; — 563 +2C1 + Сз + Сз 
be given, where С; denotes the quantity of the ith chemical fertilizer used in tons, i є {1, 2, 3}, 
and Р(С, C2, Сз) is the resulting profit in thousands of EUR. We determine the quantities 
of the three sorts of chemical fertilizers such that the profit is maximized. Checking the 
necessary conditions of Theorem 11.7, we get 


Po, = -С +С 72€ 4-220 (11.11) 
Po, = 2C; — 2С› 4-C4 +1= 0 (11.12) 
Ро = С Сз +1 = 0. (11.13) 


Adding equations (11.12) and (11.13), we obtain 
Cy = 2C, +2 = 2(Ci + 1). (11.14) 
Inserting equation (11.14) into equation (11.11), we get 
—C?+C,+4C,+44+2=0 
which can be rewritten as 
С} — 5С, — 6 = 0. 
The latter quadratic equation has two solutions denoted by С 9 and Co: 


CG? =6 аа CY=-1. 


The second root cy = —1 does not lead to a point belonging to the domain of function P. 
Using cr = 6, we get from equations (11.14) and (11.13) 


cP 22060 +)=4 and CP acM41=15. 


So, we have only one stationary point (6, 14, 15) within the domain of function P. Next, 
we check whether this point is indeed a local maximum point. We get the Hessian 
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Hp(Cj, C2, Сз) of function P as follows: 


—2Cı +1 2 0 
Hp(C1, С, Сз) = 2 —2 1 
0 1 -1 


Thus, for point (C, СЇР, СЇ) = (6, 14, 15), we obtain 


c 2 0 
Hp(6, 14, 15) = Devt X 
0 1 -l 


Checking the sufficient conditions for a local maximum point, we determine the leading prin- 
cipal minors Dı, D; and Р; of matrix Hp (6, 14, 15) and test the conditions of Theorem 11.8 
using Theorem 10.7: 


zii 9 
Dı = —11 < 0, D= 3 3 Eas, 
=. 2 0 
D; = 2 -2 1|=-7<0. 
0 1 -1 


Thus, from Theorem 10.7 the Hessian Hp (C1, C2, Сз) is negative definite for point (6, 14, 15), 
and this point is indeed the only local maximum point found by differential calculus. Since 
function P has no larger objective function values on the boundary of the domain and since 
function P has continuous partial derivatives, the obtained solution is also the global maxi- 
mum point with function value P(6, 14, 15) — 396.5, which means that the resulting profit 
is equal to 396, 500 EUR. 


1.7.2 Method of least squares 


We consider the following application of the determination of an extreme point of a function 
of several variables. Assume that n points (xj, yj), i = 1,2,...,n, are given. The aim is 
to determine a function f of a specified class (for instance a linear function or a quadratic 
function) such that f (x) describes the relationship between x and y ‘as well as possible’ for 
this class of functions. 


We apply the criterion of minimizing the sum of the squared differences between y; and f (x) 
(denoted as the Gaussian method of least squares), i.e. 


Q-— (fo) – y) > min! 
i=l 


It is assumed that we look for an approximation by a linear function f with f (x) = ax + b 
(see Figure 11.8). 


The objective formulated above leads to the following optimization problem with function 
Q = Q(a, b) depending on two arguments a and b: 


Q(a,b) = X (ax; + b — yi)? > min! 


i=1 
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тр T T3 14 [ERE A т 
Figure 11.8 Approximation by a linear funtion f with f(x) = ax + Ь. 


As we know from Theorem 11.7, the necessary conditions for an extreme point are that the 
first-order partial derivatives with respect to the variables a and b are equal to zero: 


Ог (a, b) = 0 and Оь(а, b) = 0. 


Determining the first-order partial derivatives of function Q, we get: 


Ол(а, Б) 22 M xi (ai b- yi) = 0 
i=1 


Op(a, b) = 2 Ў (ах +Ь—у) = 0. 


i=1 


This is a system of two linear equations with two variables a and b which can be solved by 
applying Gaussian elimination or the pivoting procedure (we leave this calculation to the 
reader). As a result, we obtain the values of parameters a and b as follows: 


O9 (>) E) (=) 


п п 2 
2 
nox — (oxi 
i=1 i=1 


; (11.15) 
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The latter equality can be rewritten in terms of parameter a calculated according to 
equation (11.15) as follows: 


n n 
Ун «У 
і=1 i=1 


b=! жае (11.16) 


п 


One can derive that the denominator of value a is equal to zero if and only if all x; values аге 
equal. Thus, we get a unique candidate for a local extreme point of function Q provided that 
not all x; are equal. Applying Theorem 11.9, we get that a and b obtained above satisfy the 
sufficient conditions for a local minimum point and we have determined the ‘best’ straight 
line ‘through’ this set of п points according to the criterion of minimizing the sum of the 
squared differences between y; and the function values f (x;) of the determined straight line. 


Example 11.24 The steel production of Nowhereland during the period 1996—2001 
(in millions of tons) was as shown in Table 11.1: 


Table 11.1 Data for Example 11.24 


Year 1995 1996 1997 1998 1999 2000 2001 


Production 450.2 458.6 471.4 480.9 492.6 501.0 512.2 


Drawing the relationship between the year and the steel production in а coordinate system, 
we see that it is justified to describe it by a linear function, and therefore we set 


f(x) =ax+b. 
First, we assign a value x; to each year as follows and compute the required values. 


Table 11.2 Application of Gaussian method of least squares 


Year i Xj Yi х? х;у; 

1995 1 —3 450.2 9 —1,350.6 
1996 2 —2 458.6 4 — 917.2 
1997 3 —1 4714 1 — 4714 
1998 4 0 480.9 0 0 
1999 5 1 492.6 1 492.6 
2000 6 2 501.0 4 1,002.0 
2001 7 3 512.2 9 1,536.6 
Sum 0 


3,366.9 28 292.0 


In Table 11.2, we have taken the year 1998 as origin x4 = 0 (of course, this can be arbitrarily 
chosen). With our choice, the x values are integer values ranging from —3 to 3, i.e. they are 
symmetric around zero. Such a choice is recommended for cases when n is odd and distances 
between successive x values are equal since it simplifies that calculations. 


422 Functions of several variables 


Using n = 7, we can immediately apply formulas (11.15) and (11.16), and we obtain 


n 


n n 
= ES = ‚ 292.0 — 0 - 3366. 
E =. = MN OEE LES 10.429, 


п "EE 7.28— 02 
n- Ух — (5:5) 
ї=1 i=1 
n n 
у н =a: 3-3) 
p= =! i=1 _ 3366.9 — 10.429 -0 ~ 480.986. 


n Е 7 
The above values for parameters а and b have been rounded to three decimal places. Thus, 
we have found function f with 


f (x) ~ 10.429x + 480.986 


describing steel production between 1995 and 2001 (ie. for the years x є 
(—3, —2, —1,0,1,2, 3}). Using this trend function, we obtain for the expected steel production 
for the year 2006 (i.e. for x — 8): 


f (8) © 10.429 - 8 + 480.986 = 564.418 


millions of tons. 


Example 11.25 Assume that the following ordered pairs have been found experimentally: 
(x1, y1) = (1,3), (х2,у2) = (2, 8), (x3, y3) = (1,0) and (x4, y4) = (—2, 1). 


Further, we assume that the relationship between variables x апа у should be expressed by a 
function f of the class 


f(x) =ax+ bx”. 


In this case, we cannot use formulas (11.15) and (11.16) for calculating the parameters a and b, 
but we have to derive formulas for determining the parameters of the required function. This 
may be done in a quite similar way as for the approximation by a linear function. According 
to the Gaussian method of least squares we get the problem 


n 


Оа = Y (axi + bx? у) — min! 


i=1 


Using n = 4, we obtain the first-order necessary conditions for a local extreme point: 


Qq(a,b) =2 555 (ax; + bx? - 5) 20 
i=1 


4 
On(a,b) =2 Y x} (ax; + bx? — yi) = 0. 


i=1 
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We can rewrite the above conditions as follows: 


4 4 4 
Vix =a xp +b x (11.17) 
i=1 i=1 i=1 
4 4 4 
X xia pth) xf. (11.18) 
i=1 i=1 i=1 


From equation (11.17) we obtain 10a = 17 and thus a = 1.7, and from equation (11.18) we 
obtain 34b = 39 and thus b = 39/34 ~ 1.147. Hence the best function f of the chosen class 
according to Gauss's method of least squares is approximately given by 


Ро) © 17x + 144727. 


11.7.3 Extreme points of implicit functions 
Next, we give a criterion for the determination of local extreme points in the case of an 


implicitly defined function F(x, y) = 0. 


THEOREM 11.10 Let function f : Dy — К, ру C RR, be implicitly given by F(x, y) =0 
and let function F be twice continuously partially differentiable. If the four conditions 
(1) FG5,59) = 0, (2) F.G5,59) = 0, (3) Fy(x°,y9) 0 and (4) Fa(®, y?) Æ 0 are 
satisfied, then (х0, у) is a local extreme point of function f , in particular: 


(1) If 


Bey) 
—— > 0, 
Fy (x°, y°) 


then (x°, y?) is a local maximum point. 
(2) If 


Fax, y?) 
——— uar ет , 
Fy (х0, y°) 


then (х0, y?) is a local minimum point. 
y р 


Example 11.26 Given 
F(x,y) = —х? + 2y? —8=0, 


we determine local extreme points by applying Theorem 11.10. To find the stationary points, 
we check 


Р(х,у) = —2х = 0, 


from which we obtain the only solution x; = 0. Substituting this into F (x, y) = 0, we get the 
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two solutions y; = 2 and y2 = —2, i.e. there are two stationary points 
Ру: (х,у) = (0,2) and P2 : (x1,y2) = (0, –2). 
Furthermore we get 
Fy(x,y) = 4y and Fy (x,y) = —2. 
For point P, this yields 


Fyx(0,2)  —2 


жуэ е су 
Е,(0,2) 8 


i.e. point (0,2) is a local minimum point. For point P5, we get 


F&(0,-2) -2 


= — >], 
Fy (0, —2) —8 


i.e. point (0, —2) is a local maximum point. It should be noted that F(x,y) = 0 is not a 
function but only a relation here, nevertheless we can apply Theorem 11.10 for finding local 
extreme points. 


11.8 CONSTRAINED OPTIMIZATION 


So far, we have dealt with optimization problems without additional constraints. However, 
in economics, one often looks for an optimum of a function subject to certain constraints. 
Such problems are known as constrained optimization problems. We have already considered 
problems where the output ofa firm depends on the quantity ofthe used capital K and the used 
quantity of labour L. A natural question in economics is the maximization ofthe output subject 
to certain budget constraints. This means that the quantity of capital and labour which can 
be used in production is bounded by some given constants. In the case of a linear objective 
function and linear inequality constraints, we have already considered such problems in 
Chapter 9. Here, we consider the minimization (or maximization) of a function subject to 
some equality constraints. Such a problem may arise if the output Q(K, L) has to be equal to 
a given isoquant Q*. The production cost depends on the quantity of used capital and labour. 
In that case, the objective is the minimization of a cost function f. depending on variables K 
and L subject to the constraint O(K,L) = Q*. In this section, we consider the general case 
of constrained optimization problems with non-linear objective and/or constraint functions. 


11.8.1 Local optimality conditions 
We consider the problem 
2 = f (x1,x2,..., x04) — min! (max!) 
s.t. g1(x1,X2,...,x,) = 0 


д2(х\,хә, e Xn) = 0 


m Oa. X2, ER Xn) = 0, 
and we assume that m < n. Next, we discuss two basic methods of solving the above 
problem. 
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Optimization by substitution 


If the constraints 
Gi(X1,%2,..-,Xn) = 0, i= 1,2,...,m, 


can be put into its reduced form (according to Theorem 11.6, this means that the Jacobian 
determinant at point x = (x1,x2,...,Xn) is different from zero) 


X1 = W1 (Xm+1;Xm+2, - - -> Xn) 
хэ = W2 (Xm+1,Xm+2» - - -> Xn) 
Xm = Wm(Xm+1,Xm+2, Xn); 


we can substitute the terms for x1, x2, .. . , хт into the given function f. As a result, we get an 
unconstrained problem with n — m variables 


W (Xm+15Xm+2>--+>Xn) — min! (max!), 


which we can already treat. Necessary and sufficient optimality conditions for the case 
n = m + 1 (i.e. function W depends on one variable) have been given in Chapter 4.5 and for 
the case n > т + 1 (function W depends on several variables) in Chapter 11.7. This method 
is known as optimization by substitution. The procedure can also be applied to non-linear 
functions provided that it is possible to eliminate the variables x1, x2,...,Xm. This approach 
is mainly applied in the case when we look for the optimum of a function of two variables 
subject to one constraint (i.e. n = 2, m — 1). 


Lagrange multiplier method 


In what follows, we discuss an alternative general procedure known as the Lagrange multi- 
plier method which works as follows. We introduce a new variable A; for each constraint i 
(1 < i < m), and add all the constraints together with the multipliers to the function to be 
minimized (or maximized). Thus, we obtain: 


L(x, X) = L(x1,x2, э ‚Хп; ^1, À2, жуу »Am) 
m 
=f (x1,%2, t »Xn) + УзА Я 9: 01,22, t Xn). 
i=1 


The above function Г is called the Lagrangian function and the parameters ^; are known as 
Lagrangian multipliers. 


Note that function L depends on n + m variables. The following theorem gives a necessary 
condition for the existence of a local extreme point of the Lagrangian function. 


THEOREM 11.11 (Lagrange's theorem) Let functions f : Dy — R, Dy с R”, and 
gi: Dg, > R, Dg, СОК”, і = 1,2,...,m, m < n, be continuously partially differentiable 
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and let x? = La. aen. 200) € Dy be a local extreme point of function f subject to the 
constraints gj(x1,X2, ...,x4) = 0, i= 1,2,..., m. Moreover, let 79, х9, ^ E # 0. Then 
grad L(x°, 49) = 0. (11.19) 
Condition (11.19) in Theorem 11.11 means that the Lagrangian function L(x;A) has a 
stationary point at a,x ee ‚х0; А, 29, ons 25) and can be written in detail as follows: 
m 0 0 0 
0gi(x1,X5, . -> Xn) | 
Le (x0; A0) = f eee + Уо? yes de "^=(; ў=1,2,...,п, 
i=1 
Laa; 9) 29,00,39,...,3) 20; 12 1,2,...,m. 


The second part ofthe formula above, which requires that the partial derivatives of function L 
with respect to all variables A; must be equal to zero, guarantees that only vectors x are 
obtained which satisfy all constraints. Assume that we have determined the stationary points 
ofthe Lagrangian function by means of Theorem 11.11. We now look for sufficient conditions 
to verify whether a stationary point is indeed an (isolated) local extreme point and if so, 
whether it is a local minimum or a local maximum point. In the case of an unconstrained 
optimization problem, a criterion has already been given, where the Hessian has to be checked. 
As the following theorem shows, in the case of constrained optimization problems, we have 
to check the so-called bordered Hessian which includes the usual Hessian in its lower right 
part as a submatrix. 


THEOREM 11.12 (local sufficient conditions) Let functions f : Dy — К, Dy С К", 
and gi : Dg, > R, Dg, С R”, i = 1,2,...,m, m < n, be twice continuously partially 
differentiable and let (x9; 49) with x? є Dy be a solution of the system grad L(x; A) = 0. 
Moreover, let 


Or xx 0 с iio) 
Qo WS e 20: Essa se s 

H, х;А = mX1 тХп 
10:3 = | жуу му Egan GA) | Ley, OG A) ayy, OA) 
Ly41,05 4) mt Lys СА) Lx, (XA) ST Lyx, (5 A) 


be the bordered Hessian and consider its leading principal minors 1H; (x9; A9)| of the order 
j = 2m + 1,2m + 2,...,n + m at point (х0; 49). Then: 


(1) Ifall leading principal minors IH; (x; A9), 2m -1 <j € n 4- m, have the sign (—1)”, 
then x? = (х?, х а x9) is a local minimum point of function f subject to the constraints 
gi(X) 20,1 = 1,2,...,m. 

If all leading principal minors 1H; (x9; A9), 2m -- 1 <j < n + m, alternate in sign, the 
sign of |H, (х9; A9)| = [Ну (x9; A°)| being that of (—1)", then x? = (Las ds 0) is 
a local maximum point of function f subject to the constraints g;(x) = 0,i = 1,2,...,m. 
If neither the conditions of (1) nor those of (2) are satisfied, then x° is not a local extreme 
point of function f subject to the constraints g;(x) = 0,i = 1,2,...,m. Here the case 


(2 


— 


G 


— 
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when one or several leading principal minors have value zero is not considered a violation 
of condition (1) or (2). 


It is worth noting that the order of the bordered Hessian Hz is n+ m if the problem includes n 
variables x; and m constraints. We illustrate the application of Theorem 11.12 for two special 
cases. Assume first that n = 2 and m = 1, i.e. function f depends on two variables x, y and 
there is one constraint g (x, y) = 0. In that case, we have 2m 4- 1 = 3 and only the determinant 
Hs (x9, 9; 49)| = 1H (x9, y9; А0)| has to be evaluated at the stationary point (х9, y^; А0). If 
|» (x9, 9; А9)| < 0, i.e. the sign of the determinant is equal to (—1)” = (—1)! = —1, then 
(x? 5i is alocal minimum point according to part (1) of Theorem 11.12. If А (х9, у9, A0) > 
0, i.e. the sign of the determinant is equal to (—1)” = (—1)? = 1, then G° y) is a local 
maximum point according to part (2) of Theorem 11.12. However, if |H3 (хо, A9)| — 0, then 
a decision cannot be made on the basis of Theorem 11.12 since, due to the comment at the 
end of part (3), none of the conditions (1), (2) or (3) is satisfied (1.e. neither condition (1) nor 
condition (2) holds, nor does the violation mentioned in (3) occur). 


Assume now that n — 3 and m — 1, i.e. function f depends on three variables and there 
is one constraint g(x) = 0. In this case, the bordered Hessian Hz has order n+ m = 4. 
Moreover, we have 2m + 1 = 3, ie. the leading principal minors |H3(x°;°)| and 
|H4(x°; 49)| = |Hz(x°; ;A9)I have to be саш From Theorem 11.12 we can draw the fol- 
lowing conclusions. If |H4(x°; 2°)| > 0, point x° is not a local extreme point due to part | 
of Theorem 11.12, since the sign of |H4(x°; A°)| is different both from (—1)" = (—1)! = 

and (—1)” = (-1? = —1. 


If |Ha(x9, 9)| < 0 and |H3(x°, 49)| < 0, i.e. both leading principal minors have the sign 
(—1)" = (—1)! = —1, then according to part (1) of Theorem 11.12, x? is a local minimum 
point. 


If |H4(x9; 49)| < 0, but |Æ (х0; 49)| > 0, i.e. both leading principal minors have alternative 
signs, and the sign of |H4(x°; A°)| is equal to (—1)” = (—1)? = —1, then according to part 
(2) of Theorem 11.12, x? is a local maximum point. If Ha (x9; 49)| < 0, but |Æ (x; 2°)| = 0, 
then a decision about an extreme point cannot be made. (Neither condition (1) nor condition 
(2) is satisfied, but according to the remark at the end of part (3), none of the conditions are 
violated.) 


If |H4(x°; А0) | = 0, then a decision about an extreme point cannot be made independently 
of the value of the leading principal minor |H3(x°; 49)| (see remark at the end of part (3) of 
Theorem 11.12). 


Example 11.27 We want to determine all local minima and maxima of functionf : R? > R 
with 


z=f(x,y) = х? +4y – 3 
subject to the constraint 
g(x,y) = 4x? — 16x + y? + 12 = 0. 


We formulate the Lagrangian function depending on the three variables x, y and A, which 
gives 


L(x,y,À) =x? + 4? — 3 + (A — 16x 4- у? + 12). 
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The necessary conditions for a local optimum according to Theorem 11.11 are 


Ly(x, y, A) = 2x + 8Ax — 164 = 0 

Ly (x,y, A) = 8y + 2Ay = 0 

Ly (x,y, à) = AX — 16x - y? +12 = 0. 
Factoring out 2y in equation (11.21), we obtain 

2у. (4+) = 0. 


Thus, we have to consider the following two cases. 


(11.20) 
(11.21) 


(11.22) 


Casel y — 0. Substituting y = 0 into equation (11.22), we get 4x? — 16x + 12 = 0 and 


x! — 4x +3 = 0, 


respectively. This yields 


xy =24+V4-3=3 and x2 =2-V4—-3=1, 


and from equation (11.20) we obtain the corresponding A-values А = —3/4 and ^» = 1/4. 


Hence point 
: 3 
Pi: (х,у) = (3,0); with Aj = = 
and point 
; 1 
P2: (х,у) = (1,0); with Az = A 


are candidates for a local extreme point in this case. 


Case 2 4+). = 0, ie. A = —4. Then we obtain from (11.20) the equation 
2x — 32x + 64 = 0, which has the solution 
_ 64 32 
Хз = 30 7 15 


Substituting the latter into equation (11.22), we obtain the equation 


32\? 32 
4| 5| —16.— + у°+12—=0 
(5) BTE 


which yields 


2 _ 884 
225° 


32 4884 
Рз : (x3,y3) = (5. ——— # ian) ; with Аз = —4 
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and 
32 4884 
Рд : (X3, уд) = (5-5 xR =) 5 with Ад = —4 


as candidates for a local extreme point. 
Next, we check the local sufficient conditions for a local extreme point given in Theo- 
rem 11.12. We first determine 


Lyx (X,Y; A) = 2 + 82 

Ly, ул) = 0 = Lyx (x,y; A) 

Lyy(x,y; A) = 8 + 2А 

Ly, yA) = 8х – 16 = Lax (x,y; A) 

LyX, y; A) = 2y = у(х, у; А). 
Thus, we obtain the bordered Hessian 


0 |&-16 2y 
Н.(х,у;) = | 8x—16| 2+8 0 
2у 0 8 + 24 


Since we have two variables in function f and one constraint, we apply Theorem 11.12 with 
n = 2 and m = 1. This means that we have to check only the leading principal minor for 
j = 2т+ 1 = п+ т = 3, ie. we have to check only the determinant of the bordered 
Hessian to apply Theorem 11.12. We obtain for the four stationary points of the Lagrangian 
function: 


{ 0 8 0 ies 
m (3,0:-3)| = 8 4 0 =65-| 3 МЕСЕ 
0 0 65 
о -8 0 
1 Е 
1 (1.0:7)| = -8 4 0 |=85 ee ral 
0 0 85 
= 8.5. (—64) <0, 
Р 16 2/884 
15 125 
32 ./884 16 
н {2° У°°®._А\\-| 2 ар 10 0 
(S i5 ) 15 д 
24/884 
i 0 0 
15 
А 16 25/884 
15 15 
32 /884 16 
И ыле лы ОО 0 > 0. 
15° 15 15 
24/884 
-2 0 0 
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Hence, due to part (1) of Theorem 11.12, the points (3,0) and (1,0) are local minimum 
points, since in both cases the sign of the determinant of the bordered Hessian is equal to 
(—1)” = (—1)! = —1, and due to part (2) of Theorem 11.12, the points (32/15, 884/15) 
and (32/15, —/884 /15) are local maximum points, since the sign of the determinant of the 
bordered Hessian is equal to (—1)" = (—1)? = +1. Notice that in this problem the check of 
the conditions of Theorem 11.12 reduces to the determination of the sign of the determinant of 
the bordered Hessian (however, in the general case of an unconstrained optimization problem 
we might have to check the signs of several leading principal minors in order to verify a local 
extreme point). 

Some level curves of function f together with the constraint g(x,y) = 0 are given in 
Figure 11.9. To illustrate, we consider equation f (x, y) = 0 corresponding to x? + 45 = 3 
which can be rewritten as 


(a) +(5) =" 


The latter is the equation of an ellipse with the half-axes a = 4/3 and b = 4/3/2. If we 
consider now f (x,y) = C with C Æ 0, we also get ellipses with different half-axes, where 
the half-axes are larger the bigger the value of C is. The constraint g(x,y) = 0 can be 
rewritten as 


A(x? — 4x) +y? + 12 = 402 — 4x +4) y? -4=0 


which also corresponds to the equation of an ellipse: 


(x — 2)? + 0) =e 


Sov 


Figure 11.9 Level curves for function f with f (x, y) = x? + 4y? —3; 


From Figure 11.9 one can confirm that zı = 6 and z2 = —2 are indeed the local minima and 
Z3 = 24 = 3885/225 ~ 17.3 are the local maxima of function f subject to the given constraint. 
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Example 11.28 А utility function assigns to each point (vector) х = (x1,x5, .. . , Xn), where 
the ith component x; describes the amount ofthe ith commodity (i= 1,2, ..., п; sucha vector 
is also called a commodity bundle), a real number u(x1, x2, . . . , Un) which measures the degree 


of satisfaction or utility of the consumer with the given commodity bundle. Assume that a 
Cobb-Douglas utility function и : R2 — R+ depending on the amounts of two commodities 
with 

ux, y) = х?у? 
is given. We wish to maximize this utility function subject to the constraint 


7x + 5y = 105. 


The equation 7x + 5у = 105 describes the budget constraint. It is convenient to take 
logarithms of function u and work with equation 


In u(x, y) = 21nx + 3lny. 
This is possible since, due to the strict monotonicity of the logarithmic function, it has no 
influence on the optimal solution whether we minimize function и or function In и. Thus, we 
have to solve the following constrained optimization problem: 

f(x,y) 221Inx - 3lny — max! 
subject to 

7x + 5у = 105. 
We consider two approaches to solving this problem. 
The first way is to apply optimization by substitution, which means that we have to substitute 


the budget constraint into the function of the maximization problem from the beginning. 
This yields 


7 
у= wx) = 21 — =x. 
5 
If we do this, the problem is now as follows: 
7 
W(x) = 21ах + 31а { 21 — s — max! 


The necessary optimality condition of first order for this problem is as follows (see 
Chapter 4.5, Theorem 4.12): 


5 uM 
105—7x 5 


; 2 
№ (х) = 3 
x 
which can be written as 


Wen re ж 
x 105 – 7х 
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The latter equation yields the solution x? = 6. Substituting this back into the budget constraint, 
we get 


7 
y 221-6.6-126. 


To check the sufficient condition given in Chapter 4.5 (see Theorem 4.14), we determine 


W" (x) Y ai! 
х) = – LEE et 
x (105 — 12 
Thus, we obtain for x? = 6 
W" (6) 2 EH 0.0926 < 0 
= 2 &-0. < 0, 
36 632 


and we can conclude that x? = 6 is a local maximum point of function W. 
The second way to solve this example is to apply Lagrange’s multiplier method. We set up 
the Lagrangian function L with 


L(x,y; A) = 21nx + 3Iny + A(7x + 5y — 105) 


and differentiate it to get the three necessary optimality conditions of the first order according 
to Theorem 11.11: 


2 

L,(x,yj;A) = = +71 = 0 (11.23) 
x 
3 

Ly(x,y;4) = > + 5А = 0 (11.24) 
у 

Г (х,у,А) = 7х + 5y — 105 = 0. (11.25) 


Let us solve this system first for А and then for х and y. After cross-multiplying, we get from 
equations (11.23) and (11.24) 


ТАх = —2 and 5Ay = —3. 


We now add these two equations and use equation (11.25): 
—5 = Ax + Sy) = 1054 


which gives 


A Esc. 
21° 
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Just as before, substituting this back into the equations (11.23) and (11.24) and solving them 
for x and y, we get 


х9 = 6 and у9=12.6. 


To check the sufficient conditions given in Theorem 11.12, we determine 


Lyx (x,¥5 A) = -5 
Ly, y; 4) = 0 = Ly (x,y; A) 
Ly y;À)- E 

y 


LyX, ¥34) = 7 = Lx, yi) 
Lya (Q5 y; A) = > = Liy(x,y; A), 


and obtain the bordered Hessian: 


0| 7 5 
2 
Н(х,у;) = | 7 E P 
3 
5| 0 -5 
y 


Since n = 2 and m = 1, according to Theorem 11.12 we have to check only one leading 
principal minor for j = 2m + 1 = 3 which corresponds to the determinant of the bordered 
Hessian. For point (x°, y9; 4°) = (6, 12.6; —1/21), we get 


0| 7 5 

вале 0 
L , U' 91 18 $ 
5 — 055 

12.62 


Notice that we do not have to calculate the values of both determinants above, we only 
need the sign of the left-hand-side determinant. Since the first summand obviously has a 
value greater than zero (the first determinant has a negative value multiplied by a negative 
factor) and the second determinant also has a value greater than zero, the determinant of the 
bordered Hessian is greater than zero. According to part (2) of Theorem 11.12, we find that 
P9: (x0, y9) = (6, 12.6) is a local maximum point since the sign of the determinant of the 
bordered Hessian is equal to (C 1)" = (—1)* = +1. 
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11.8.2 Global optimality conditions 


Next, we present a sufficient condition for a global maximum and a global minimum point. 


THEOREM 11.13 (global sufficient conditions) Let functions f : Dy > R, Dy С R”, 
and gi : Dg, > R, Dg, С R”, i = 1,2,...,m, m < n, be twice continuously partially 
differentiable and let (x9; A?) with x? є Dy be a solution of the system grad L(x; A) = 0. 
If the Hessian 


Li QAM) o Lex 3A) 
Н(х; 59) = : | 
Lyx) (x; A9) PARS Lxx, (x; A9) 


(1) is negative definite for all points x € Dy, then х0 is the global maximum point of function 
f subject to the constraints g;(x) = 0,i = 1,2,...,m; 

(2) is positive definite for all points x € Dy, then x is the global minimum point of function 
f subject to the constraints g;(x) = 0,i = 1,2,...,m. 


Theorem 11.13 requires us to check whether the Hessian is positive or negative definite for 
all points х € Dy, which can be rather complicated or even impossible. If function f and the 
constraints g; contain only quadratic and linear terms, the second-order partial derivatives are 
constant, and evaluation of ће determinant of H (x, А0) is possible. We look at the following 
example. 


Example 11.29 Given is the function f : Dy > R, Dy © ЕЗ, with 
f(x,y,z) = 2x + 2y — 22 
S.t. 
x? + 2y? + 32? = 66. 


We determine all global optimum points by applying Theorems 11.11 and 11.13. Setting up 
the Lagrangian function, we get 


L(x,y,z; A) = 2x + 2y — 2z + A + 2y? + 322 — 66). 


Applying Theorem 11.11, we get the necessary optimality conditions 


Ly =2+2Ax = 0 (11.26) 
Ly =2+4у=0 (11.27) 
І, = —2+ 6\2 = 0 (11.28) 
І = х? +2y* + 32? — 66 = 0. (11.29) 


From equations (11.26) to (11.28), we obtain 
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Substituting the above terms for x, y and z into equation (11.29), we obtain 


1 1 
T3. = 66 


1 
2. 
Е 442 942 


2 


x? + 2у? + 32? = 
which can be rewritten as 

1 just o1 П _ 6 

A2 2 3) › 67 


This yields 


jm 
36 
which gives the two solutions 


à=- and à2 = == 


Using equations (11.26) to (11.28), we obtain the following two stationary points of the 
Lagrangian function: 


1 

Pi: x =-—-6, yı = —3, AEPA with АЛ = 6 
and 

А 1 

P2: x2 =6, y2 = 3, z = —2; with Mies. 


We now check the sufficient conditions for a global optimum point given in Theorem 11.13. 
For point P1, we obtain 


1 
м 0 0 0 0 
Н (х,у,2; А) = 0 44 O = 0 2 0 
0 0 6A; 3 
0 0 1 


(Notice that the above matrix does not depend on variables x, y and z since the mixed second- 
order partial derivatives are constant.) All leading principal minors have a positive sign and by 
Theorem 10.7, matrix Н (x, y, z; 41) is positive definite. Therefore, Рі is the global minimum 
point. 


м 0 0 =з. 0 0 
A(x, y,2342) = 0 44 0 = _2 0 
0 0 6A 3 
0 = 


The leading principal minors alternate in sign beginning with a negative sign of the first 
leading principal minor. Therefore, matrix Н (x, у, 2; A2) is negative definite, and P; is the 
global maximum point. 
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11.9 DOUBLE INTEGRALS 


In this section, we extend the notion of the definite integral to functions of more than one 
variable. Integrals of functions of more than one variable have to be evaluated in probability 
theory and statistics in connection with multi-dimensional probability distributions. In the 
following, we restrict ourselves to continuous functions f depending on two independent 
variables. 


Definition 11.13 The integrals 


b y2(x) b 
1 =| [tone de | вов 
а Ху) а 
апа 
d [ x2) d 
һ= || | rana] a= | һо 
с V) с 


are called (iterated) double integrals. 


For simplicity, the parentheses for the inner integral are often dropped. To evaluate 
integral Г, we first determine 


y2(x) 
Р(х) = / f (x,y) dy, 
yi(x) 


where x is (in the same way as for partial differentiation) treated as a constant, and then we 
evaluate the integral 


b 


h= [roa 


a 


as in the case of integrating a function of one variable. To evaluate both integrals, we can 
apply the rules presented in Chapter 5. 


Example 11.30 We evaluate the integral 


1= | [299 
R 
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where А contains all points (x, y) with х2 < y € 3x and 0 < x < 1 (see Figure 11.10). 
We obtain 


1 
= | [x x) = x. e dx — [or — x?) dx 
0 


0 
9 
= LALE 
4 6 


1 25 
ar ae SED 


Figure 11.10 Region R considered in Example 11.30. 


Finally, we consider the case where both the lower and the upper limits of integration of both 
variables are constant. In this case, we get the following property. 


THEOREM 11.14 Let functionf : Dy > R, Dr € R2, be continuous in the rectangle given 
bya <x < Бапіс x y < d. Then 


b d d b 


| [ron d = | [rena d 


a с c a 
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One application of double integrals is the determination of the volume of a solid bounded 
above by the surface z = f (х, y), below by the xy plane, and on the sides by the vertical walls 
defined by the region R. If we have f (x, y) > 0 over a region R, then the volume of the solid 
under the graph of function f and over the region R is given by 


- || f(x,y) dx dy. 
R 


Example 11.31 We want to find the volume V under the graph of function f with 
Гоу) = 23 + у? 
and over the rectangular region given by 0 < x < 2 апа 0 < у < 1. We obtain: 


2 


1 
y= || оу ов | [e +» dx 
R 0 


0 
2 3 1 
= f (5) dx 
3 0 
0 
f 1 2x3 ? i6 2 18 
X X 
= 2x?- 2| de= = ==: 
f G1) i (2+3) 7 


yo 3^3 3 
0 


Thus, the volume of the solid described above is equal to six volume units. 


The considerations in this section can be extended to the case of integrals depending on more 
than two variables. 


EXERCISES 
11.1 (a) Given is a Cobb-Douglas function f : R2 — R with 
z =f y) =A: xy, 
where A = 1,0, = 1/2 and o; = 1/2. Graph isoquants for z = 1 and z = 2 
and illustrate the surface in R?. 


(b) Given are the following functions f : Dy > R, Df © R?, with z = f (x, y): 


(i) z= /9-—x?2-y?; (ii) z= у; (iii) z —x?- 4x + 2y. 


Graph the domain of the function and isoquants for z = 1 andz = 2. 


11.2 


11.3 


11.4 


11.5 


11.6 


11.7 


11.9 


Functions of several variables 439 


Find the first-order partial derivatives for each of the following functions: 
(а) 2= (х,у) =» sin’ y; (b 2= 70у) = х0); 
(с) 2= (х,у) = х +y; (d 2= (х,у) =In(./x/y); 


2 2 2 
(e) z = f(x1,x2,x3) eae eS (f) z=f@1,x2,x3) = fx? +35 + х2. 


The variable production cost C of two products P, and P» depends on the outputs x 
and y as follows: 


1, 200,000 32, 000, 000 
C (x, y) = 120x + ———— —— + 800y + —————, 
x y 


where (x, y) € R? with x є [20,200] and y € [50, 400]. 


(a) Determine the marginal production cost of products P, and P2. 
(b) Compare the marginal cost of Р for xy = 80 and x2 = 120 and of P2 for 
y1 — 160 and y? — 240. Give an interpretation of the results. 


Find all second-order partial derivatives for each of the following functions: 


(а) z= f(x1,x2,x3) = x + Suites + 2x2 + In(x1x3); 

1 + 
O z—fe»- 1 © z=f@y) =n. 
Determine the gradient of function f : Dy —> R,Dy € ui with z = f(x,y) and 
specify it at the points (xo, yo) = (1,0) and (х,у) = (1,2): 


(a) z= ах + by; (b 2= 2х2 + ху? + siny; (с) 2= 4/9 – х2 – у2. 


Given is the surface 
z =f (x,y) = x^ sin? y 


and the domain Dy = IR2, where the xy plane is horizontal. Assume that a ball is 
located on the surface at point (x,y,z) = (1, 1,z). If the ball begins to roll, what is 
the direction of its movement? 


Determine the total Чисто for the following functions: 
(à z=f@y)=siny; (b+) z=f(,y) =x + ху? + siny; 


(с) z=fa%y=e"™, (d z=f(,y) my. 

Find the surface of a circular cylinder with radius r = 2 m and height h = 5 m. 
Assume that measurements of radius and height may change as follows: r = 2+0.05 
and h = 50.10. Use the total differential for an approximation of the change of the 
surface in this case. Find the absolute and relative (percentage) error of the surface. 


Letf:Dy— R, Df C R2, be a function with 


2 
2 = f (x1,x2) = хте, 


where х1 = х1 (t) and x? = x2 (t). 
(a) Find the derivative dz/dt. 


(b) Use the chain rule to find z/(£) if (i) xi = 2; x? = In’; (ii) x = In£?; 
X2 = g 
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11.10 


11.11 


11.12 


11.13 


11.14 


11.15 


(c) Find z'(f) by substituting the functions of (i) and (ii) for xı and x2, and then 
differentiate them. 


Given is the function 


z foy) = ү9—х? —у?. 


Find the directional derivatives in direction r! = (1,0)',r? = (1,DT,P = 
(—1, —2)T at point (1, 2). 
Assume that 


C (x1, x2, x3) = 20 + 2x1x2 + 8x3 + x2 In x3 + 4x1 


is the total cost function of three products, where x, x5, хз are the outputs of these 
three products. 


(a) Find the gradient and the directional derivative with the directional vector 
r = (1,2,3)! of function C at point (3,2, 1). Compare the growth of the cost 
(marginal cost) in the direction of fastest growth with the directional marginal 
cost in the direction r. Find the percentage rate of cost reduction at point (3, 2, 1). 

(b) The owner of the firm wants to increase the output by 6 units altogether. The 
owner can do it in the ratio of 1:2:3 or of 3:2:1 for the products х] : x2 : 23. 
Further conditions are x; > 1, хз > 1, and the output x2 must be at least 4 units. 
Which ratio leads to a lower cost for the firm? 


Success of sales z for a product depends on a promotion campaign in two media. Let 
xı and x2 be the funds invested in the two media. Then the following function is to 
be used to reflect the relationship: 


z = f (x1,x2) = 10 /x1 + 20 (0 + 1) + 50; xı > 0,x5 > 0. 


Find the partial rates of change and the partial elasticities of function f at point 
(x1, x2) = (100, 150). 

Determine whether the following function is homogeneous. If it is, what is the degree 
of homogeneity? Use Euler's theorem and interpret the result. 


(a) z=f(x1,x2) = AT + 2x1 x5 + x3; (b) z=f(%1,x2) = х\х$ +х]. 


Let F(x, у) = 0 be an implicitly defined function. Find dy/dx by the implicit-function 
rule. 


2 2 

(а) FQy25- Tz -1=0 (у> 0); (Ы) F@,y)= xy- sin3x = 0; 
а 

(с) F(x,y) =x” —Inxy+x*y = 0. 

We consider the representation of variables х and у by so-called polar coordinates: 


x —rcosq, y=rsing, 


11.16 


11.17 


11.18 


11.19 


11.20 


11.21 


11.22 
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or equivalently, the implicitly given system 


F\(x,y37, p) —rcosQ -x= 0 
ЕК›(х,у;т,ф) =rsing — y = 0. 
Check by means of the Jacobian determinant whether this system can be put into its 


reduced form, i.e. whether variables r and ф can be expressed in terms of x and y. 
Check whether the following function f : Dy > R, Df € R?, with 


z S fy) = xXy!(0 — x — y) 


has a local maximum at point (x1, у) = (1/2, 1/3) and a local minimum at point 
(2, y2) = (1/7, 1/7). 

Find the local extrema of the following functions f : Ру > R, ру с R?: 

(а) z=f@y=xryt+y—-y; (b+) z=f (x,y) = х2у – 2ху + je. 

The variable cost of two products P, and P2 depends on the production outputs х 
and y as follows: 


32, 000, 000 
+ Еа 


1,200,000 
C(x,y) = 120x + = + 
x 


800y 


B 


where Dc — R2. Determine the outputs x? and у? which minimize the cost function 
and determine the minimum cost. 


Given is the function f : R? — R with 

РО, у,2) = xi - 2x + y? — 2z3y + 327. 
Find all stationary points and check whether they are local extreme points. 
Find the local extrema of function C : Dc > R, Dc € R2, with 


C(x) = C(xi,x2,xa) = 20 + 2x1xo + 8x3 + x? In x3 + 4x1 


and x € R?, хз > 0. 


The profit P of a firm depends on three positive input factors х, x2, хз as follows: 
P(x1,x2,x3) = 90x1x2 — xix) — xix + 601пхз — 4x3. 
Determine input factors which maximize the profit function and find the maximum 


profit. 
Sales y of a firm depend on the expenses x of advertising. The following values x; of 
expenses and corresponding sales y; of the last 10 months are known: 


xi | 20 20 24 25 26 28 30 30 33 34 
yi | 180 160 200 250 220 250 250 310 330 280 


(a) Find a linear function f by applying the criterion of minimizing the sum of the 
squared differences between y; and f (x;). 
(b) What sales can be expected if x = 18, and if x = 36? 
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11.23 


11.24 


11.25 


11.26 


11.27 


11.28 


11.29 


Given the implicitly defined functions 


259 _ 2\2 
hy co D m 2 1=0, 


verify that F has local extrema at points Р! : (x = 1,y = 5) and Рз : (x = 1,y = 
—1). Decide whether they are a local maximum or a local minimum point and graph 
the function. 


Find the constrained optima of the following functions: 


(а) z=f(x,y) =x? + xy - 2y?, 
s.t. 


2х +y = 8; 


(b) z =f (x1, %2,%3) = 3x2 + 2x3 — xix, 
S.t. 


х\+х =3 and 2хухз=5. 
Check whether the function f : Dy > R, ру С R3, with 
z =f (x,y,z) =x? —xz +y?  ylz – 22, 
s.t. 
x—-y)—z—-220 and x+z-4=0 


has a local extremum at point (11, —4, —7). 


Find the dimensions of a box for washing powder with a double bottom so that the 
surface is minimal and the volume amounts to 3,000 cm?. How much cardboard is 
required if glued areas are not considered? 


Find all the points (x, y) of the ellipse 4x? + y? = 4 which have a minimal distance 
from point Ро = (2,0). (Use the Lagrange multiplier method first, then substitute 
the constraint into the objective function.) 


The cost function C : RÌ — R of a firm producing the quantities x1, x2 and хз is 
given by 


С(х1,х2,х3) = x + 2x2 + 2x3 — 2x41x2 — 3x1x3 + 500. 
The firm has to fulfil the constraint 
2x1 + 4x2 + 3x3 = 125. 


Find the minimum cost. (Use the Langrange multiplier method as well as optimization 
by substitution.) 


Evaluate 


f (2x+y+1)dR, 
R 


11.30 


11.31 


Functions of several variables 443 
where the region А is the triangle with the corners Р] : (1, 1), Рә : (5,3) апар; : 
(5, 5). 


Find the volume of a solid of constant height h — 2 which is bounded below by the 
xy plane, and the sides by the vertical walls defined by the region R: R is the set of 
all points (x, y) of the xy plane which are enclosed by the curves y? = 2x + 1 and 
х-у-1= 0. 

Evaluate the double integral 


I Е dx dy, 
X 
R 
2 


where R is the region between the two parabolas x = y? and y = х2. 


12 Differential equations and 
difference equations 


We start our considerations with the following introductory example. 


Example 12.1 We wish to determine all functions y = у(х) having the rate of change 
3./2x + 1. Using the definition of the rate of change (see Chapter 4.4), this problem can be 
formulated as follows. Determine all functions y = y(x) which satisfy the equation 


oy(x) =” КОЛИ И стти 
у(х) 


The latter equation can be rewritten as 


у(х) = 3V 2x + 1- у(х). 


This is a relationship between the independent variable x, the function y(x) and its derivative 
y (x). Such an equation is called a differential equation. 


In this chapter, we discuss some methods for solving special types of differential equations 
(and later related difference equations). In what follows, as well as у(х), y' (x), . . . we also use 
the short forms y, y’,... First, we formally define a so-called ordinary differential equation. 


Definition 12.1 A relationship 


Е(х,у,у',у',...,у?) =0 (12.1) 


between the independent variable x, a function у(х) and its derivatives is called an 
ordinary differential equation. The order of the differential equation is determined by 
the highest order of the derivatives appearing in the differential equation. 


If representation (12.1) can be solved for y, we get the explicit representation 


y — f (xy, y, y" iym D) 


Differential and difference equations 445 


of a differential equation of order n. The notion of an ordinary differential equation indicates 
that only functions depending on one variable can be involved, in contrast to partial differen- 
tial equations which include also functions of several variables and their partial derivatives. 
Hereafter, we consider only ordinary differential equations and for brevity, we skip ‘ordinary’. 


Definition 12.2 А function y(x) for which relationship F(x,y,y’,y’,...,y™) = 0 
holds for all values x € D, is called a solution of the differential equation. The set 


S = (УО) FG yy. uy) =0 forallx € Dy} 


is called the set of solutions or general solution of the differential equation. 


12.1 DIFFERENTIAL EQUATIONS OF THE FIRST ORDER 


A first-order differential equation can be written in implicit form: 
F(x,y,y) = 0 
ог in explicit form: 


y =f (x,y). 


In the rest of this section, we first discuss the graphical solution of a differental equation of the 
first order and then a special case of such an equation which can be solved using integration. 


12.1.4 Graphical solution 


Consider the differential equation у = f (x, y). At any point (xo, yo), the value y' = f (хо, yo) 
of the derivative is given, which corresponds to the slope of the tangent at point (xo, yo). 
This means that we can roughly approximate the function in a rather small neighbourhood 
by the tangent line. To get an overview on the solution, we can determine the curves, along 
which the derivative y’ has the same value (i.e. we can consider the isoquants with y’ being 
constant). In this way, we can 'approximately' graph the (infinitely many) solutions of 
the given differential equation by drawing the so-called direction field. This procedure is 
illustrated in Figure 12.1 for the differential equation у = x/2. 

For any x € R (and arbitrary y), the slope of the tangent line is equal to x/2, i.e. for x = 1, 
we have y' — 1/2 (that is, along the line x — 1 we have the slope 1/2 of the tangent line), 
for x = 2, we have у = 1 (i.e. along the line x = 2 the slope of the tangent line is equal to 1) 
and so on. Similarly, for x = —1, we get y = —1/2, forx = —2 we get y' = —1 and so on. 
Looking at many x values, we get a rough impression of the solution. In the given case, we 
get parabolas (in Figure 12.1 the parabolas going through the points (0, —2), (0, 0) and (0,2) 
are drawn), and the general solution is given by 

M 
у= 4 +C 


with C being an arbitrary real constant. 
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Figure 12.1 The direction field for у = x/2. 


Example 12.2 We apply this graphical approach to the differential equation 


Figure 12.2 The direction field for y' = —x/y. 
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For each point on the line y = x, we have y’ = —1, and for each point on the line y = —x, 
we have у = 1. Analogously, for each point on the line y = x/2, we get у = —2 and for 
each point on the line y = —x/2, we get y’ = 2. Continuing in this way, we get the direction 
field given in Figure 12.2. Thus, the solutions are circles with the origin of the coordinate 
system as the centre, i.e. 


x+y =C (C20). 


(In Figure 12.2 the circles with radius 2, i.e. C = 4, and radius 4, i.e. C = 16, are given.) 
Implicit differentiation of y? = C — x? confirms the result: 


2y. y' = —2x 
у х 
MeL 

y 


12.1.2 Separable differential equations 


We continue with such a type of differential equation, where a solution can be found by 
means of integration. Let us consider the following special case of a first-order differential 
equation: 


y =/(х,у) = g(x) - h), 


i.e. function f (x, y) can be written as the product of a function g depending only on variable x 
and a function / depending only on variable y. In this case, function f = f (x,y) has a special 
structure, and we say that this 1s a differential equation with separable variables. Solving 
this type of differential equation requires only integration techniques. First, we rewrite 


d 
y = = = g(x) ho) 
X 


in the form 


dy 


hy) = g(x) dx. 


That is, all terms containing function y are now on the left-hand side and all terms containing 
variable x are on the right-hand side (i.e. we have separated the variables). Then we integrate 
both sides 


dy — 
IPFE 


and obtain 


HO) = Gx) + C. 
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If we eliminate y, we get the general solution y(x) including constant C. A particular (or 
definite) solution y" (x) is obtained when C is assigned a particular value by considering an 
initial condition у(хо) = yo. The latter problem of finding a particular solution is also referred 
to as an initial value problem. We illustrate this approach by the following two examples. 


Example 12.1 (continued) If we rewrite equation 


d 
A = у MEET. 
lx 


in the form 
d 
E es 3./2x + 1 dx, 
y 


we get a differential equation with separable variables. Let us integrate both sides: 
dy 


a 3 | x 1 dx. (12.2) 
y 


Now we get on the left-hand side the natural logarithm function as an antiderivative. 
Considering the integral on the right-hand side, we apply the substitution 


u=2x+1 


which yields 


du, 
dx — 


Now we can find the integral: 
1 1 
[та = ; | мааи SW ee 


Therefore, equation (12.2) leads to 
In |у = (2x + 1)3/2 + С". 
Thus, we get the general solution 


3/24 с* 3/2 * 
lvl = eO D TC = orth) -ef 


which can be rewritten as 
3/2 
pl=C- eD, — c.g 


Here we have written the term e°” as a new constant C > 0. By dropping the absolute values 
(they are now not necessary since the logarithmic terms have been removed), we can rewrite 
the solution obtained as 


уб) = C. eD 
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with C being an arbitrary constant. If now the initial condition y(0) = e is also given, we 
obtain from the general solution 


е= С. el, 
i.e. C = 1. This yields the particular solution 
yx) = ery”? 


of the initial value problem. 


Example 12.3 We wish to determine all functions y = y(x) whose elasticity is given by 
the function 


€y(x) = 2x + 1. 
Applying the definition of elasticity (see Chapter 4.4), we obtain 


у(х) 
УЕ 


We can rewrite this equation as 


d 
d 2x 
y x 


or, equivalently, 


d 
? = (242) a 
y x 


This is a differential equation with separable variables and we can integrate the functions on 
both sides. This yields: 


е-е) 


In [у| = 2x + In |x| + C* 


* ж 
2x+ln |х|+С* __ 2x. e” |x| . eC 


у| =е е 


ly] = e” -|x|-C, C>0. 
By dropping the absolute values, we can rewrite the solution obtained as 
у(х) = C-x-e* 


with C being an arbitrary real constant. 


450 Differential and difference equations 


Finally, we mention that we can reduce certain types of differential equations by an appropriate 
substitution to a differential equation with separable variables. This can be done e.g. for a 
differential equation of the type: 


у = g(ax + by + с). 


We illustrate this procedure by an example. 


Example 12.4 Let the differential equation 
/ 2 
y = (9x +у+2) 


be given. (In this form it is not a differential equation with separable variables.) Applying 
the substitution 


u = 9x +y +2, 
we get by differentiation 


du 9 dy 
dx /— dx 


which can be rewritten as 


After substituting the corresponding terms, we get 


LCS 


dx 


This is a differential equation with separable variables, and we get 


du 


——— = dx. 
u?--9 X 


Integrating both sides, we obtain 
І actam ус 
~-arctan- = х 
3 3 

or, correspondingly, 

u 
arctan = 3(x + С) 

и 

3 = tan 3(x + C) 

и = 3tan3(x + C). 
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After substituting back, we get 


9x + y +2 = 3tan3(x + C) 
y = 3tan3(x + C) — 9х ~ 2. 


12.2 LINEAR DIFFERENTIAL EQUATIONS OF ORDER п 


Definition 12.3 A differential equation 
y? dL asy sb Tecra (x) y! + ао(х) y = q(x) (12.3) 


is called a linear differential equation of order n. 


Equation (12.3) is called linear since function y and all derivatives occur only in the first 
power and there are no products of the n+ 1 functions y, y’,...,y). The functions a;(x),i = 
1,2,...,n — 1, depend on variable x and can be arbitrary. 


Definition 12.4 If q(x) = 0 in equation (12.3), i.e. function q = q(x) is identically 
equal to zero for all values x, the differential equation (12.3) is called homogeneous. 
Otherwise, i.e. if q(x) Æ 0, the differential equation is called non-homogeneous. 


Function q — q(x) is also known as the forcing term. Next, we present some properties of 
solutions of a linear differential equation of order n and then we discuss an important special 
case. 


12.2.1 Properties of solutions 


Homogeneous differential equations 


First, we present some properties of solutions of linear homogeneous differential equations 
of order n. 


THEOREM 12.1 Let yj (х), y2(x), ..., ym (x) be solutions of the linear homogeneous differ- 
ential equation 


yP + dni Q)y "7 +--+ + ag()y = 0. 124) 
Then the linear combination 
уб) = Су yi) + Cayo) +--+ + Cm Vm (x) 


is a solution as well with C1, C2,..., Cm € R. 
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Definition 12.5 The solutions уу (x), y2(x),...,¥m(x), m x n, of the linear homoge- 
neous differential equation (12.4) are said to be linearly independent if 


Ci yi (x) + C2y2(X) +--+ + Cn ym(x) = 0 forall x € D, 


is possible only for С = C2 = --- = Cm = 0. Otherwise, the solutions are said to be 
linearly dependent. 


The next theorem gives a criterion to decide whether a set of solutions for the homogeneous 
differential equation is linearly independent. 


THEOREM 12.2 The solutions у (х), у2(х),...,ут(х), m < n, of the linear homogeneous 
differential equation (12.4) are linearly independent if and only if 


ло) рюб)  .. mE) 
о) X0 .. уһ) 

W(x) = #0 forxeD, 
у" (ху pe Р(х)... уш Рф 


W (x) is called Wronski 5 determinant. It can be proved that, if W (хо) 4 0 for some xo € Dy = 
(a, b), then we also have W(x) 4 0 for all x € (a, b). Under the assumptions of Theorem 
12.2, it is therefore sufficient to investigate W (xo) for some particular point xo € Dy. If we 
have W (xo) 4 0, the m solutions y (х), уг (x), .. ., y (x) are linearly independent. 


Example 12.5 We check whether the functions 


1 Inx 
yi) =x, у(х) = = and y3(x) = — 
x x 


are linearly independent solutions of the differential equation 


/// 


xy + 4x2 y" Txy – у = 0. 


We first determine the required derivatives of the function y;,i є {1,2, 3}, and obtain 


уб) = 1, у(х) = 0, у(х) = 0, 
1 2 6 
yŒ) = EV. y? (х) = x уу (x) = x^ 
1—Inx —34+2Inx 11 — 6х 
уз (х) = ee уҙ (x) = - IS xd у (x) zx E x 
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Next, we check whether each у;,і є {1,2,3}, is a solution of the differential equation. 
We get 


yi):x-0x4x Ох = 0, 
2 2 DO о, Ж d 
ОКЕ (-3) i dd (3) += ( ;) E ЕГ 
х х " $ эла: 
уз(х):х?. Oy Li —3+4+2Inx 1-Шшх ах 
xt x x 5 


11—6Inx —12+8 шх 1 lnx х 
= + + 


x x x x x 


We now investigate Wronski’s determinant W (x): 


1 Inx 
x 2 т 
x 
1 1— lnx 
№ (х) = |1 E E 
2 —3 + 2lnx 
E = 
1 1—Inx 1 Inx 
2 х2 х2 «x x 
> 2 —3-42lnx 2 —3+2Inx 
х3 х3 х3 х3 
_ 3—2lnx—2+2lnx —3+2Inx—2Inx 4 
л х? x4 оа 


Thus, Wronski's determinant is different from zero for all x € Dy = (0,00). Therefore, the 
solutions y; (x), y; (x) and уз (х) are linearly independent. 


Now we can describe the general solution of a linear homogeneous differential equation of 
order n as follows. 


THEOREM 12.3 Let у(х), y2(x),...,Yn(x) be n linearly independent solutions of a lin- 
ear homogeneous differential equation (12.4) of order n. Then the general solution can be 
written as 


Sg = p" (х) | у" б) = Ci yE) + Coyne) +- + C ys), Сі, C2,- .-, C. € R}. 
(12.5) 


The solutions у (x), уз (х), . .., y4 (x) in equation (12.5) constitute a fundamental system of 
the differential equation, and we also say that y” (x) is the complementary function. Referring 
to Example 12.5, the general solution of the given homogeneous differential equation is 


1 Inx 
уН (x) = Cix + €): - + C4: —. 
x x 
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Non-homogeneous differential equations 


We present the following theorem that describes the structure of the general solution of a 
linear non-homogeneous differential equation. 


THEOREM 12.4 Let 
8н = o (х) | у" (к) = Cry) + С» ya) +--+ Cu yn, Сі, Co... C. € R} 


be the general solution of the homogeneous equation (12.4) of order n and y" (x) a particular 
solution of the non-homogeneous equation (12.3). Then the general solution of a linear 
non-homogeneous differential equation (12.3) can be written as 


S= уо) ly) 2» " @)+у' (х), у" О) e Su) 
= (у(х) |yG) = CiyiG) + Соух) +... + Cryn +" (х), Ci, Co. .., Cs € R}. 


Initial value problems 


If an initial value problem of the linear differential equation of order n is considered, n initial 
conditions 


yGo-2y» убо) =y, e. y" UG = уі (12.6) 


are given from which the n constants C1, C5, . . . , C, ofthe general solution can be determined. 
This yields the particular solution y" (x). The following theorem gives a sufficient condition 
for the existence and uniqueness of a particular solution. 


THEOREM 12.5 (existence and uniqueness of a solution) Let n initial conditions accord- 
ing to (12.6) for a linear differential equation of order n 


y? + a, 4y"7D +- ау + ay = 40) (12.7) 


with constant coefficients aj, i є {0, 1,...,п — 1}, be given. Then this linear differential 
equation has exactly one particular solution y^ (x). 


We only note that for arbitrary functions а; (х), i € (1,2,...,n— 1}, an initial value problem 
does not necessarily have a solution. 


12.2.2 Differential equations with constant coefficients 


Now, we consider the special case of a linear differential equation of order n with constant 
coefficients, i.e. aj(x) = aj fori = 0,1,2,...,n— 1. 


Homogeneous equations 


First, we consider the homogeneous differential equation. Thus, we have 


yo an yr) +- +a y + ау = 0. (12.8) 
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This type of homogeneous differential equation can be easily solved. We set 
у(х) = е with à € C 

and obtain 
у(х) = Ае, y" (x) = А2 e, ee y? (x) = ate, 


Substituting the latter terms into equation (12.8) and taking into account that the exponential 
function e^* has no zeroes, we obtain the so-called characteristic equation: 


P O) = M a, åA"! eese Bap Ad ao = 0. (12.9) 


We know from Chapter 3 that polynomial P; (A) has n (real or complex) zeroes А, 2, . . ., Ag. 
For each root A; of the characteristic equation, we determine a specific solution of the 
homogeneous differential equation (Table 12.1). 


It can be shown by Theorem 12.2 that the resulting set of solutions forms a fundamental 
system, i.e. the n solutions obtained in this way are linearly independent. It is worth noting 
that also in the case of complex roots of the characteristic equation, the solutions used are 
real. The solutions in cases (c) and (d) are based on Euler's formula: 


eer tiBOx — e%* . eiPi® — е9) (cos В; + isin fj) 


= е" cos Вух + ie sin бух = yj(x) + урт (х) 


with y; (x) and уурт (х) being real solutions. 


Table 12.1 Specific solutions in dependence on the roots ofthe characteristic equation 


Roots of the characteristic equation Solution 
(a) A; is a real root of multiplicity one YO) = m 
(b) Aj = Xl =... = APek-1 у) = е" 
is a real root of multiplicity k > 1 +10) =x- e* 
унъ-10) m xk}. е 
(с) Aj-oj4 ji yj() = е? - cos jx 
Aj+1 = aj — Ві Yj+1@) Le. sin ух 
is a pair of conjugate complex roots 
of multiplicity one 
(d) Aj = Aj+2 Sve j42k—2 = Oj + Ві Vj) ES ебх “СОЗ Bjx 
Apt = Арз = = Аро = Oj — Ві Vj x) = е0) - sin бух 
is a pair of conjugate complex roots +20) =x; ебх . cos Bjx 
of multiplicity k > 1 Yj+3 Œ) = х: e'7* - sin Bix 


Xjg2k-200 = 3*7. 
Yjà2k-1 60 = xk}. e 
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Table 12.2 Settings for special forcing terms 


Forcing term q(x) Setting у(х) 


а e (a) A- e 
if b is not a root of the characteristic 
equation (12.9) 
(b) A-xk. eb 
if b is a root of multiplicity k of the 
characteristic equation (12.9) 
bnx” + Бархи +---+bix+bo (a) Арх" БА qx" +: Ах + А 
if ag 5 0 in equation (12.9) 
(Б) x5. (ах + Appx"! + + Ах + Ao) 


if ag = 4] =" = Ap = Oin (12.9), 
1е.у,у,... ,-D do not occur in (12.8) 
a-cosax + b-sinax (a) Acosax+ Bsinox 
(a or b can be equal to zero) if wi is not a root of the characteristic 


equation (12.9) 

(b) xh. (A - соѕох + В · sin œx) 
if wi is a root of multiplicity k of the 
characteristic equation (12.9) 


Non-homogeneous differential equations 


Now we discuss how a particular solution y" (x) of a non-homogeneous linear differential 
equation with constant coefficients can be found. We describe the method of undetermined 
coefficients and consider here only three specific types of forcing terms q. In these cases, a 
solution can be found by a special setting of the solution function y" (x) in dependence on 
function q in equation (12.7). 


It can be seen that the setting in Table 12.2 follows the structure of the forcing term 
given on the left-hand side. We emphasize that the small letters in Table 12.2 are fixed in 
dependence on the given function g = q(x) whereas capital letters are parameters which 
have to be determined such that y" (x) is a solution of the given differential equation. 
Using the corresponding setting above, we now insert y" (x) into the non-homogeneous 
differential equation and determine the coefficients A, В and 4;, respectively, by a com- 
parison of the coefficients of the left-hand and right-hand sides. It should be noted that the 
modifications described in cases (b) of Table 12.2 are necessary to guarantee that the sug- 
gested approach works (otherwise we would not be able to determine all coefficients by this 
approach). Moreover, we mention that this approach can also be used for sums or products 
of the above forcing terms, where the setting has to follow the structure of the given forcing 
term. We illustrate the determination of the general solution by the following examples. 


Example 12.6 We solve the differential equation 
y' + y = 2y = 2x? — Ax +1. (12.10) 
First, we consider the homogeneous differential equation 


y" +у – 2у = 0. 
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By setting у(х) = е^ and substituting у(х), у (x) and y” (x) into the homogeneous equation, 
we get the characteristic equation 


м +А-2= 0 


which has the two solutions 


À vee І 2=1 а À | 1 02 2 
= === == == ап E = == Z. 
17 72'Y4 07 V4 


Since we have two real roots of multiplicity one, case (a) in Table 12.1 applies for each root 
and we get the complementary function 


уН (x) = Сүе* + Coe. 
Since function q is a polynomial of degree two, we use the setting 
y" (x) = Ax? + Bx + С 
in order to find a particular solution ofthe non-homogeneous differential equation. We obtain 


(Py (x) = 24x +B 
(y P" (x) = 24. 


Substituting the above terms into the differential equation (12.10), we get 
2A + (24x + B) — 2(Ax? + Bx + C) = 2x? — Ax +1. 


Comparing now the coefficients of all powers of x, i.e. the coefficients of x, x! and x°, we 
obtain: 


х2: —24 = 2 

xl: 24 — B = -4 

x: 24 + 2B — 2С = 1. 
From the first equation we obtain A = —1, and then from the second equation B = 1 and, 
finally, from the third equation C — —1/2. The general solution of the non-homogeneous 


differential equation (12.10) is given by 


1 
у(х) = у" (x) +y (x) = Cie + Coe — x? + 2х — y 


Example 12.7 We solve the non-homogeneous differential equation 


Im 


y” + 2y" + 5у = cos 2x. 
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Ах 


Considering the homogeneous equation and again setting у = е^“, we obtain the characteristic 


equation 
А + 257? + 5A = А. (А2 +2045) = 0. 
From the latter equation we get 
м =0, AQ=—-14+V1-5=-14+2i and 43 =-1-V1—-5=-1-2i, 


ie. there is a real root of multiplicity one and two (conjugate) complex roots of 
multiplicity one. Therefore, we get the complementary function 


уй (х) = Спе? + Сое cos 2x + Сзе * sin 2x 


= Су + e (Сә cos 2x + С» sin 2x). 


To find a particular solution y" of the non-homogeneous differential equation, we use the 
setting 


y" (x) = A cos 2х + B sin 2x 
and obtain 
(y) (x) = —2A sin 2x + 2B cos 2x 


ON)" (x) = —44 cos 2x — 4B sin 2x 
(y P)" (x) = 8A sin 2х — 8B cos 2x. 


Substituting y" (х) and its derivatives into the non-homogeneous differential equation, 
we obtain 


(8A sin 2x — 8B cos 2х) + 2(—4A cos 2x — AB sin 2x) 
+ 5(—2A sin 2x + 2B cos 2x) = cos 2x. 


Comparing now the coefficients of sin 2x and cos 2x on both sides of the above equation, 
we obtain: 


sin 2x : 84 — 8B — 104 = —24 — 8B = 0 
cos 2x : —8B — 84+ 108 = —84 + 2В = 1. 
This is a system of two linear equations with the two variables A and B which can be easily 
solved: 
2 1 
В = —. 
17 34 
Therefore, we obtain the general solution 


2 1 
y(x) = уН (x) yl (x) = Ci +e “(С cos 2x + Сз sin 2x) — 9 cos 2x + 34 sin 2x. 


Differential and difference equations 459 


Example 12.8 Let the non-homogeneous differential equation 


y" _ 4y' a et 


be given. In order to solve the homogeneous equation, we consider the characteristic equation 
X — 4). =107 — 4) =0 
which has the three roots 
А = 0, № = 2 and Аз = —2. 
This yields the complementary function 
y (х) = Сүе®* + Ge ce 
= Cy + Coe? + Ce 7, 


To get a particular solution of the non-homogeneous equation, we have to use case (b) 
with k = 1 for an exponential forcing term е2 in Table 12.2 since Аз = 2 is a root of 
multiplicity one of the characteristic equation, and we set 


y" (x) = Ахе^. 


(Indeed, if we used case (a) and set y" (х) = Ae**, we would be unable to determine 
coefficient A, as the reader can check.) We obtain 
ONY (x) = Ae” + 24xe? = Ae**(1 + 2x) 
ONY! Œ) = 2Ae** (1 + 2x) + 24е2* = 24e? (2 + 2x) = 44e” (1 + x) 
ONV (x) = 84e? (1 + x) + 44е2* = 4Ae** (3 + 2x). 
Substituting the terms for (y) ay 


and (y into the non-homogeneous equation, we get 


44e? (3 + 2x) — 44e?*(1 + 2x) = e”. 
Comparing the coefficients of e** on both sides, we get 
124 – 4А = 1 
which gives A = 1/8 and thus the particular solution 


1 
y (х) = pU à 


Therefore, the general solution of the given differential equation is given by 


1 
уб) = y (x) + yl) = С + Cze” + Сзе ™ + que 
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Example 12.9 Consider the homogeneous differential equation 
y" — 4у + 5у – 2у = 0 

with the initial conditions 
y(0) = 1, y(021 and y (0) = 2. 

The characteristic equation is given by 
А — 4? 54-220. 


Since the sum of the above four coefficients is equal to zero we immediately get the root 
A, = 1. Applying now Horner’s scheme for dividing the polynomial of degree three by à — 1, 
we get the equation 

43 — 42 + 5h = 2 = (0—1). 2—3). +2). 
The quadratic equation A? — 3А + 2 = 0 has the two roots 


у = 1 and Аз = 2, 


i.e. there is опе real root Лу = А = 1 of multiplicity two and one real root лз = 2 of 
multiplicity one. Therefore, the following complementary function is obtained: 


y" (x) = Cie + Сохе + Cae. 
Now we determine a particular solution satisfying the given initial conditions. We obtain: 


y#(0) = 1 : Ci + Сз = 1 
(уу (0) 21: €i €) +263 = 1 
(уй) (0) = 2: Cy +20 + 4С; = 2. 


This is a system of three linear equations with three variables which has the unique solution 
Су = 0, C2 = –1 and Сз = 1. 

Hence we get the particular solution 
у(х) = —xe + e” 


of the initial value problem. 
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12.3 SYSTEMS OF LINEAR DIFFERENTIAL EQUATIONS OF 
THE FIRST ORDER 


In this section, we briefly discuss the solution of systems of linear differential equations of 
the first order. We consider the system 


у(х) = ап yi (x) + ai2 у(х) +--+ + ain yaQ) + 910) 
V(X) = a21 yi (xX) + a22 y2(x) + +++ + aan n(x) + qa (x) 


y,G) = ап у (х) + an2 у(х) nA p Ann Yn (x) F 9п(х) 


or, equivalently, in matrix representation 


y, CO ап ар ... dim yi) qi) 
yŒ) a2) 422 ... An y2(x) 42 (x) 
E 5 6 | а (12.11) 
yj, (x) Gn] п ... Ann yn(x) Яп (х) 
A system is called homogeneous if all functions д; (х), i = 1,2,...,n, are identically equal 


to zero. We define a solution and the general solution of a system of linear differential 
equations of the first order in an analogous way as for a differential equation. Now a solution 
is characterized by a vector у(х) = (y1 (х), у2(х),...,уп(х))Т of n functions satisfying the 
given system of n linear differential equations. For an initial value problem, also given are 
the n initial conditions 


y1xo) = y10, y2(xo) = у20, I Yn xo) = упо, 
and a particular solution y" (x) is a vector of n functions in which the n constants from the 


general solution have been determined by means of the initial values. 


The following theorem gives a relationship between one linear differential equation of order 
n and a system of n linear differential equations of first order. 


THEOREM 12.6 Let a differential equation of order n be linear with constant coefficients. 
Then it can be written as a system of n linear differential equations of first order with constant 
coefficients. 


PROOF Ifa linear differential equation of order n is given in the form (12.7), we set 


yiG) = у(х), р(х) = у(х), ...‚ уа) =у“ 0 (x) 
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and obtain a system of differential equations: 


у(х) = y2(x) 
y») = уз(х) 


Yp—1 0) = Yn) 


Vy (x) = –аоу (x) — ayz (x) — +++ — An—1¥n—1(%) + 40). 
a 


Under additional assumptions (which we do not discuss here), there is even an equivalence 
between a linear differential equation of order n with constant coefficients and a system 
of n linear differential equations of the first order with constant coefficients, i.e. a linear 
differential equation of order n can be transformed into a system of n linear differential 
equations of the first order with constant coefficients, and vice versa. By the following 
example, we illustrate this process of transforming a given system into a differential equation 
of order n by eliminating systematically all but one of the functions y;(x). 


Example 12.10 Let the system of first-order differential equations 
у= у +уз +e 
у = у 23 
у= у +y +e 
be given with the initial conditions 
yi(0)=0, — y2(0) = 0 and }3(0) = 1. 


We transform this system containing three functions y; (х), уг (x), уз (x) and its derivatives 
into one linear differential equation of order three. To this end, we differentiate the first 
equation which yields 


yi =y, +y + e 
= (y1 = уз) + Gi +y2 +e") + e* 
= 2у + y2 — уз + 2e”. 


Differentiating the latter equation again, we get 


/// 


у = 29) +y — уз + 2e” 
= 2(у2 + уз + е) + (у — уз) — Ол +2 + е) + 2e* 
= y2 + уз + Зе". 


Now we can replace the sum уз + уз by means of the first equation of the given system, and 
we obtain the third-order differential equation 


yl! y, = 2€. 
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Using the setting 


yna) =e", 
we obtain the characteristic equation 
›3—=0 
with the roots 
у = 0, № = 1 and Аз = —1. 
This yields the following general solution of the homogeneous equation: 
уН = Сү + С.е + Су-е *. 
In order to find a particular solution of the non-homogeneous equation, we set 
yl! = Axe* 


(notice that number 1 is a root of the characteristic equation, therefore we use case (b) in the 
setting for an exponential forcing term according to Table 12.2), which yields 


OW) = Дек + 1), О)” = Ae*(x +2) and (y )" = Дех + 3) 
and therefore 
Ae" (x + 3) — Ae* (x + 1) = 2е*, 
i.e. we obtain 
e (Ax + ЗА — Ax — A) =2e* 
from which we obtain 
A = 1. 
Therefore, the general solution is given by 
y) = С + Ce + Cze * + xe". 


We still have to determine the solutions уә (х) and уз (х). This can be done by using the first 
equation of the given system and the equation obtained for уү. We get 


yty=y-& 
уз — уз =y] — 2у — 2€. 


Adding both equations and dividing the resulting equation by 2 gives 


1 3 
у-у е. 


1 / 
30) = 3 +5 2 
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Inserting the general solution yı and its derivatives 
yi = Соё — Cze + e + xe 
yy = Ое" + Cze * + 2e* + хе", 

we get 
у(х) = С — Cae *. 


Subtracting the above equations for y? + уз and y2 — уз and dividing the resulting equation 
by two yields 


1 1 1 
уз(х) = 3n — y" +у + 5° 
Inserting the above terms for y; and уү we obtain 
y3(x) = Ci + Се" + xe. 


Next, we consider the initial conditions and obtain 


у100) =0: С, + C; + Сб; = 0 
y2(0) = 0: —С, - С; = 0 
y3(0) = 1: с + с = 1. 


This system of three linear equations with three variables has the unique solution 
С = 1, С = 0 and Сз = –1. 
The particular solution of the initial value problem is 
yl (x) =1l-—e*+xe* 
уо) =-1+e™* 
X (x) = 1 + xe". 


It is worth noting that not every initial value problem necessarily has a solution. Next, we 
discuss an approach for solving a system of linear differential equations of the first order 
directly, without using the comment after Theorem 12.6. 


Homogeneous systems 


We first discuss the solution of the homogeneous system, i.e. we assume g;(x) = 0 for 
i = 1,2,...,n. We set 


yix) Z1 
у(х) Z2 


y(x) = рее (12.12) 


yaQ) Zn 


Using (12.12) and 


y, €) E 

/ 
V(x) Z2 

f . = . e, 
y, (x) Zn 


we obtain from system (12.11) the eigen 


ар = А a12 ain 
azı an — А an 
ап] an2 апп — А 


We determine the eigenvalues A1, A2,... 
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value problem: 


21 
22 


‚ An and the corresponding eigenvectors 


1 2 n 

21 21 21 

1 2 п 

2, 2, 2 

z! = 2 В 22 = в К ш; 71 : 
21 22 z^ 


n 


As in the case of one differential equation, we have to distinguish several cases. 
In the case of real eigenvalues А; of multiplicity one, the general solution y (x) is as follows: 


уй (х) 21 z? z 
1 2 n 
x Z 2 2 
уН (х) = Ya ( ) = Ci 5 ећ1х ES C : e? x eu Gi 2 e, 
yË (x) zl z2 21 


The other cases can be handled analogously to the case of a differential equation of order n. 
We give only a few comments for the cases of a real eigenvalue of multiplicity k > 1 and of 
complex eigenvalues a; + if; of multiplicity one. 


If A; is a real eigenvalue of multiplicity k, the eigenvectors z',z’t!,...,z't*—! associated 


with this eigenvalue are not necessarily linearly independent. Therefore, we have to modify 
the term for finding the corresponding part of the general solution, and we use the following 
setting for an eigenvalue A; of multiplicity k: 


1 2 k 
21 Zi 21 
1 2 k 
2. | 2 2 2 Ve ds 
C; я enix ER С 2 xe^* Жз ЕЕ Cr : x* lg. 
zi z zk 


n n 


In the case of complex eigenvalues о; + if; of multiplicity one, we take one of them, e.g. 
a; +if;, and determine the corresponding eigenvector Z (which contains in general complex 
numbers as components in this case). Then we calculate the product 


т) e iE; = # . e" . (cos В; + isin Bj) = z*. 
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The real part z/ and the imaginary part z/* ! of the resulting vector z*, respectively, yield two 
linearly independent solutions, and we use the setting 


BH m 
C; z + Car zt 


for the corresponding part of the general solution of the homogeneous differential equation. 


Non-homogeneous systems 


A particular solution of the non-homogeneous system can be determined in a similar way 
as for a differential equation of order n. However, all occurring specific forcing terms 
qı (x), q2(x), ..., qu(x) have to be considered in any function y (x). 


We finish this section with two systems of linear differential equations of the first order. 


Example 12.11 We consider the system 


у = M 
Уу = y» + уз + 253 
X = -2y 4у2 3y3 


with the initial conditions 
yi (0) = 13, y2(0) = 0 and y3(0) = —4. 


We use the setting 


yi) E 
у(х) |= | 2 |е“. 
yx) Z3 


Inserting this and the derivatives у, i є {1,2,3}, into the system, we get the eigenvalue 
problem 


2-0 0 zi 
1 od 2 z |=0, 
2 4 ex 23 


i.e. we have to look for the eigenvalues of matrix 


2 0 0 
А=|1 1 3 
os end. cud 


Determining the eigenvalues of matrix A, we investigate 


2-0 0 
й=М =! 1 ees. 03 -0-»9. 


NS UN 
2 4 —3-A 


—4- (23625 


(2—.[@—5)(—3 — 5) + 8] = (2—5). 02+ 2л +5) = 0. 
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This yields the eigenvalues 


Ay=2, Ag=—-l4+v1-—-5=-14+2i and 23—-1—-4^41—52-—1- 2i. 


Next, we determine for each eigenvalue a corresponding eigenvector. For A, = 2, we obtain 
the system 


021 + 021 + 021 = 

z! — 21 + 221] = 

1 1 а= 

—2z; — 42, — 5 = 
The coefficient matrix of this system has rank two (row 1 can be dropped and the third 
equation is not a multiple of the second equation), so we can choose one variable arbitrarily. 


Choosing 21 = 6, all components in the resulting solution are integer and the corresponding 
eigenvector is 


—13 
z! = —1 
6 


For 4? = —1 + 2i, we get the following system: 


G-2)2 + 022 + 022 E 
z? + Q-2)02 + 222 = 
-27 - 422 + (2 2022 = 


From the first equation, we get 
z =0. 
This reduces the original system to the following system with two equations and two variables: 


Q-2)2 + 2 = 0 
—425 + (2 2022 0. 


Obviously, both equations (i.e. both row vectors) are linearly dependent (since multiplying 


the first equation by (—1 — i) yields the second equation). Choosing 22 = 2, we obtain 


z = —2 + 2i. Thus, we have obtained the eigenvector 


0 
r = 2 
—2 + 2i 


Finally, for 43 = —1 — 2i, we get the system 
G-c-2)z2 + 023 + 023 = 

zi + Q-c2025 + 223 = 

—2z} — 423 + (2+ 2i)z3 
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As in the above case, we get from the first equation z — 0. Solving the reduced system with 
two equations and two variables, we finally get the eigenvector 


0 
P= 2 
—2 — 2i 


Thus, we have obtained the solutions 


“13 0 0 
—1 |e, 2 еС!+20х _ ара 2 еС1+20х, 
6 —2+2 qu 


In order to get two linearly independent real solutions, we rewrite the second specific solution 
as follows. (Alternatively, we could also use the third solution resulting from the complex 
conjugate.) 


0 0 
2 eC 20x. — 2(cos 2x + i sin 2x) e: 
—2 + 21 (—2 + 21) (cos 2х + i sin 2x) 
0 
= 2(cos 2x + i sin 2x) ех. 


—2 cos 2х — 2 sin 2х + i(2 cos 2х — 2 sin 2x) 


In the first transformation step above, we have applied Euler’s formula for complex numbers. 
As we know from the consideration of differential equations of order n, the real part and the 
imaginary part form specific independent solutions (see also Table 12.1), i.e. we obtain 


0 0 
2 cos 2x e" and 2sin2x e^. 
—2(cos 2x + sin 2x) —2(— cos 2х + sin 2x) 


From the considerations above, we obtain the general solution as follows: 


0) m 0 
уб) }=Cr{ -1 Je*«c; 2 cos 2x g^ 
уз(х) 6 —2(cos 2х + sin 2х) 
0 
+ C3 2 sin 2x e * 


—2(— cos 2х + sin 2x) 
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Using —2C7 = Cj for i € {2,3}, we can simplify the above solution and obtain 


yix) —13 0 
nw |=с,| -1 Je*«c6 — cos 2x e* 
уз (x) 6 cos 2x + sin 2x 
0 
+ Сз — sin 2х e". 


— cos 2x + sin 2x 


It remains to determine a solution satisfying the initial conditions. We obtain 


yi(0) = 13: —13Cj + 0 + 0 = 13 
y20)= 0: -^ - Q + 0-2 0 
y3(0) = —4: 6С, + C5 EE C3 = —4 


which has the solution: 
Cı = –1; Со = 1 апа Сз = -1. 


Thus, the particular solution у? (x) of the given initial value problem for a system of 
differential equations is as follows: 


yP (x) 
уР(х) = | xo 
yh (x) 


13 0 0 
= 1 e? + — cos 2x ех + sin 2x e^. 
—6 cos 2x + sin 2x cos 2x — sin 2x 


Example 12.12 We consider the following system of linear differential equations of the 
first order: 


X7» +е^ 
уу = 41 — 3y2 + 3x. 


In order to determine a solution of the homogeneous system 
ГА 
у= у 
y = 4y1 — 332, 


we use the setting 


ee ) = ( д] je 
y2(x) Z2 ` 
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and determine first the eigenvalues of matrix 


e 5) 


formed by the coefficients of the functions у; (х) in the given system of differential equations. 
We obtain 


a-an=|'7? 0 | 


4 —3—У 
= (1—A)(-3 — A) 20 


which has the solutions 
Ay =1 and № = —3. 


We determine the eigenvectors associated with both eigenvalues. For 4; = —1, we obtain 
the system 


02! + 021 = 0 
421 — 421 = 0, 


which yields the eigenvector 


1 
1 21 ) ( 1 ) 
= = ; 
1 
| 23 | 
For A2 = —3, we obtain the system 
422 + 022 = 0 
422 + 022 = 0, 


which yields the eigenvector 
2 
2 0 
22 
Thus, we obtain the following general solution y" (x) of ће homogeneous system: 


у!) LOS 2 QA. as 
o -a(1)e +a(q Je 


Next, we determine a particular solution y" (x) of ће non-homogeneous system. Since we 
have the two forcing terms 3x and e?*, we use the setting 


Ng) = у Q) _ ( 4x + Ао + Ae 
У YN (x) Bix + Bo + Be% J: 
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Notice that, according to the earlier comment, we have to use both the linear term and 
the exponential term in each of the functions y (x) and yy (x). Differentiating the above 
functions, we get 


QNE) ане 
ойу) ХВ + 2Be* JF 


Inserting now y" and ( yy y,i є {1,2}, into the initial non-homogeneous system, we obtain 
Ay + 24e? = Aix + Ao + Ае^ +e” 
B, + 2Be* = 441x + 440 + 44е2* — ЗВух — ЗВо — 3Be™ + Зх, 
which can be rewritten as 
A1 + 24e? = Ао + Aix + (4+ ie 
Ву + 2Be* = (449 — 3Bo) + (441 — 3B1 + 3)x + (44 — 3B)e”™. 


1 


Now we compare in each of the two equations the coefficients of the terms x!,x° and e?” 


which must coincide on the left-hand and right-hand sides: 


xb :0=A; 
0 = 441 — 3B) +3, 


which yields the solution 


Aj =0 and Bi = 1. 


Furthermore, 
x? н A1 = Ag 
Bj = 4Ао — 3Bo. 


Using the solution obtained for B, and А], we find 
Ay = 0 and Bo = —-. 


Finally, 


e* :24=A+1 
2B = 4A — 3B, 


which yields the solution 


4 
A=1 and B 
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Combining the results above, we get the particular solution y" (x) of the non-homogeneous 
system: 


Finally, the general solution y(x) of the non-homogeneous system of differential equations 
is given by the sum of y (x) and y" (x): 


yo) = ( era ) = уо) жуто) 


za CVA S Vere D 4 
= (1 1 е 2 1 e PIS NS 


12.4 LINEAR DIFFERENCE EQUATIONS 


12.41 Definitions and properties of solutions 


While differential equations consider a relationship between a function of a real variable and 
its derivatives, in several economic applications differences in function values have to be 
considered. A typical situation arises when the independent variable is a discrete variable 
which may take e.g. only integer values. (This applies when a function value is observed 
every day, every month or every year or any other discrete periods.) For instance, if time 
is considered as the independent variable, and we are interested in the development of a 
certain function value over time, then it is often desired to give a relationship between 
several successive values y;,y:+1,---,¢+ Which are described by a so-called difference 
equation. For the solution of special types of such equations we use a similar concept as for 
the solution of differential equations. We begin with the formal definition of a difference 
equation of order л. 


Definition 12.6 If for arbitrary successive n+ 1 terms of a sequence {y;} the equation 


Yt+n = da-A( yren-i +: + aa (t) yea + а0(@) yc + q(t) (12.13) 


holds, it is called a linear difference equation. If a(t) 4 0, the number n denotes the 
order of the difference equation. 


Definition 12.7 If we have q(t) = 0 in equation (12.13), the difference equation is 
called homogeneous, otherwise it is called non-homogeneous. 
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Definition 12.8 Апу sequence {y+} satisfying the difference equation (12.13) is called 
a solution. The set S of all sequences {y;} satisfying the difference equation (12.13) is 
called the set of solutions or the general solution. 


First, we present some properties of solutions of linear homogeneous differential equations 
of order n. 


Homogeneous difference equations 
For a homogeneous difference equation of order n, we can state analogous results as for a 


homogeneous differential equation of order n given in Chapter 12.2.1. 


THEOREM 12.7 Let 01502... dy?) with t = 0,1,... be solutions of the linear 
homogeneous difference equation 


tin = an 1(0 yr En-1 occ ai (t) yr F ao (t) y: (12.14) 
Then the linear combination 
y = Су + Cay? ++ Су" 


is a solution as well with C1, C2,..., Cm € R. 


Definition 12.9 The solutions {y}}, {y?},..., (y), m < n, t = 0,1,.. ., of the linear 
homogeneous difference equation (12.14) are said to be /inearly independent if 


Cy + Оу +--+ Слу" = fere bl. 


is only possible for C? = C2 = - - - = Cm = 0. Otherwise, the solutions are said to be 
linearly dependent. 


The next theorem gives a criterion to decide whether a set of solutions for the homogeneous 
difference equation is linearly independent. 


THEOREM 12.8 The solutions iy}, 02) „Ду m <n, t = 0,1,..., of the linear 
homogeneous difference equation (12.14) are linearly independent if and only if 
yi yoo X 
y y uS y 
Ce" ti I |20 fort=0,1l,.... 
1 2 
Yi+m-1 Yt+m-1 coe Yim- 


C(t) is known as Casorati 5 determinant. Now we can describe the general solution of a 
linear homogeneous difference equation of order n as follows. 
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THEOREM 12.9 Let {y}}, {y7}, ...,{y/} with t = 0,1,..., ben linearly independent solu- 
tions of a linear homogeneous difference equation (12.14) of order n. Then the general 
solution can be written as 


5н = {OEY = Сй} + бу} e 6X Ci Cass 6 eR]. 


Non-homogeneous difference equations 


The following theorem describes the structure of the general solution of a linear non- 
homogeneous difference equation of order n. 


THEOREM 12.10 Let 
5н = {OE} = Су + у] e Chyi, Cis C2- Ca eR] 


be the general solution of a linear homogeneous difference equation (12.14) of order n and 
{у } be a particular solution of ће non-homogeneous equation (12.13). Then the general 
solution of a linear non-homogeneous difference equation (12.13) can be written as 


S= [bn o»! ty", oF еи] 


= {Ou ly = бу} + Cay cox XS Ci, Ca.. Cn ER}. 


Initial value problems 


If in addition n initial values of a linear difference equation of order n are given, we can 
present the following sufficient condition for the existence and uniqueness of a solution of 
the inital value problem. 


THEOREM 12.11 (existence and uniqueness of a solution) Letn successive initial values 
уо, Jl» - - -  Yn—1 for a linear difference equation 


Yt+n = An—1Yt+n-1 + а@1у „1 + аду + q(t) 


with constant coefficients aj, i € {0, 1,...,п — 1}, be given. Then this linear difference 
equation has exactly one particular solution ty? } witht = 0,1,.... 


We note that, if the n initial values are given for not successive values of t, it is possible that 
no solution or no unique solution exists. In the following, we describe the solution process for 
difference equations of the first order with arbitrary coefficients and for difference equations 
of the second order with constant coefficients. 


12.4.2 Linear difference equations of the first order 


Next, we consider a linear difference equation of the first order such as: 


yr = ao(t)y: + 90). 
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This difference equation can be alternatively written by means of the differences of two 
successive terms y;..1 and y; as 


Ay = у ы — yi = a(t)yi + q(t), 


where a(t) = ag(t) — 1. (The latter notation illustrates why such an equation is called a 
difference equation.) 


Constant coefficient and forcing term 


First, we consider the special cases ofa constant coefficient ao (t) = ао and a constant forcing 
term q(t) = q. 


If we consider the homogeneous equation у; = аоу;, we obtain the following: 
yı = 400 


y2 = aoyi = абуо 


t 
Yt = а0у&—1 = а0уо. 


If there is a constant forcing term, i.e. the non-homogeneous difference equation has the form 
Yt+1 = аду + q, we obtain: 


yı = aoyo + q 
уз = аруу + q = ao (aoyo + 4) + q = адуо + aoq + q 
уз = аду + q = ao (ayo + aoq + 4) + 4 = ajyo + 4(а + ao + 1) 


yr = abyo + (а +--+ ao + 1). 


For ао Æ 1, the latter equation can be written as follows (using the formula for the (t + 1)th 
partial sum of a geometric sequence): 


аі – | 


у = буо +9: —0——, t=1,2,... 
ao 


| 
For ао = 1, we obtain 
у = yo t qf. 


Variable coefficient and forcing term 


Now we consider the general case, where the coefficient and the forcing term depend on 
variable ft, i.e. the difference equation has the form 


yr = ao(t)y: + q(t). 
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Then we obtain: 


yı = ao(0)yo + q(0) 

ao(1)y1 + q(1) = ao(1) [ao(0)yo + 4(0)] + 401) 

ao(1)ao(0)yo + ao(1)q(0) + q(1) 

уз = a0(2)y2 + 4(2) = а0(2) [ao(1)ao(0)yo + ao(1)q(0) + 4(1)] + 402) 
= а0(2)а0(1)а0(0)уо + ao(2)ao(1)q(0) + а0(2)4(1) + 402) 


y2 


t—2 1—1 


1—1 
=] [0+ У | [] 20 |a +900), t=1,2,... 
i=0 


j=0 \i=j+1 


T 
| 


Example 12.13 Consider the following instance of the so-called cobweb model: 


Yr = 1+ 2pi 
Xt = 15 — Pt 
Jt = Xt; 


where for a certain product, y; denotes the supply, x; denotes the demand and р; the price at 
time t. Moreover, let the initial price po = 4 at time zero be given. 


The first equation expresses that a producer fixes the supply of a product in period t + 1 in 
dependence on the price p; of this product in the previous period (i.e. the supply increases 
with the price). The second equation means that the demand for a product in period t depends 
on its price in that period (i.e. the demand decreases with increasing price). The third equation 
gives the equilibrium condition: all products are sold, i.e. supply is equal to demand. 


Using the equilibrium y;+1 = x+41, we obtain 


1+ 2р; = 15 — pı 


pi = 14— 2p; 
which is a linear difference equation of the first order (with constant coefficient and forcing 
term). Applying the above formula and taking into account that ао = —2 4 1, we obtain 
t t 
i a = 1 ; (-2) — 1 
= +q: = (-2 + 14. —— —— 
Pt = agpo + 4 2:5] (—2) po ЭЕ] 


ou d zu. dd UM 
= (—7) ро t; [1 ЕРЕКЕ =). 


Using the initial condition po = 4, we get the solution 


_ 14 EA 14\ 14 i 2 
P= z tO (4 7) = 9+2 (3). 
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For the initial value problem, we get the terms 


i -Bea к ML M 
po = &, Р= 5 3 pam s 


14 2 6 14 2 30 


SETE PN ET 
АР зао Н 


which is illustrated in Figure 12.3. 


Pt, 


10 + 


Figure 12.3 The solution of Example 12.13. 
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12.4.3 Linear difference equations of the second order 


Next, we consider a linear difference equation of the second order with constant coefficients: 


у+2 = а + аду + 90), (12.15) 
or equivalently, 

Yt+2 — ау 1 — аду = Q(t). 
First, we consider the homogeneous equation, i.e. g(t) = 0: 

Yt+2 — а1у+1 — аду = 0. 


In order to determine two linearly independent solutions and then the general solution y 
according to Theorem 12.10, we use the setting 
yf = т, 


which is substituted in the homogeneous difference equation. This yields the quadratic 
characteristic equation 


m^ —aym — ag = 0 (12.16) 


which has the two solutions 


m2= = E, — + ap. 


We can distinguish the following three cases. Treatment of these cases is done in the same 
way as in Chapter 12.2.2 for linear differential equations (where the term y;+; in a difference 
equation ‘corresponds’ to the derivative y (x) in the differential equation). 
(1) Inequality 

ES, 

— + ao > 

4 0 


holds, i.e. there are two different real roots of the quadratic equation. Then we set 
у = Сут! + Com). 
(2) Equation 
ai + ао = 0 
4 


holds, i.e. there is a real root mj = m» of multiplicity two. Analogously to the treatment 
of a differential equation, we set 


yr = (Cy + Сот. 
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(3) Inequality 
2 


“з б 
— + ао < 
4 0 


holds, i.e. there are two complex roots 


mi» —azc bi 
of the quadratic characteristic equation with 
ai 


rst. and b= 7 


—a? — 4ao 


Then we get the real solutions 

yr = r! - (Cicosat + C; sinat), 
where 

r= Vm = Va? + BP 


and o is such that 


4a? + aga? 
a a y 80 04 b 
: м 20 = = 2 


and sino — = 
2а0 r 2а0 


cosg = 


Next, we consider the non-homogeneous equation, i.e. we have q(t) Æ 0 in equation (12.15). 
A particular solution y" of a non-homogeneous differential equation of the second order can 
be found for a polynomial or an exponential function as forcing term q(t) by a specific setting 
as shown in Table 12.3. 


Analogously to differential equations, we can also use this method of undetermined coef- 
ficients for a sum or product of the above forcing terms. In such a case, the setting has to 


Table 12.3 Settings for specific forcing terms in the difference equation 


Forcing term q(t) Setting уу 


bnt” + byt | +--+ byt Бр (а) У = Ам" As it? | +--+ Ait 40 
if ag + ay # 1 in the characteristic 
equation (12.16) 
(b) yN = t- (Ant? + Ap_yt? | +... + Ayt + А0) 
її ар + ај = Landa, Æ 2 in (12.16) 
(с) y! = 2 - (Ant” + A4 07 +--+» + At + Ау) 
if ар + ay = landa, = 2 in (12.16) 
ab! (а) y! = abt 
if b is not a root of the characteristic 
equation (12.16) 
(b) yN = A. i. oF 
if b is a root of multiplicity k of the 
characteristic equation (12.16) 
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represent the corresponding sum or product structure of the forcing term. We now apply this 
method as before in the case of differential equations. 


Example 12.14 Asanexample, we consider an instance of the multiplier-accelerator model 
by Samuelson, which describes a relationship between the national income y;, the total 
consumption c; and the total investment i; of a country at some time f. Let the following 
equations 
yr = ct d i +50 
1 
C= 2 t YI-1 


ij; = €; — Ct-1 
together with the initial conditions 
yo = 55 and yı = 48 


be given. The first equation shows that the income at time ¢ is equal to the consumption plus 
the investment plus the constant government expenditure (in our example they are equal to 
50 units). The second equation shows that the consumption at time ¢ is half the income at 
time ¢ — 1. Finally, the third equation shows that the total investment at time t is equal to the 
difference in the total consumption at times ¢ and t — 1. 


Substituting the equations for с, and i; into the first equation, we obtain 


1 
Уу = 5 у + (cr — ei) + 50 


1 1 1 
a Ор DU fein: ca AMT 50 
seit (5 Yt-1 TOE 


1 
= yr-1 — 5  yi-2 + 50. 


Using the terms y;42, yr+1, Yı instead ofy;, Yt—1, yi-2, we obtain the second-order difference 
equation 


1 
Yt+2 = yt — 2 : yr + 50. 


To find the general solution of the homogeneous equation, we use the setting 


у= т! 


which, after substituting into the difference equation, yields the quadratic equation 
1 
2 
m = т+ = = 0. 
2 


We get the two solutions 
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Using a, = 1 and ао = —1/2, we get the following equations for r and o: 


is 
=|= = -v2 
FN D5 


and o is such that 


aj 1 /1 I5 
e A = ; = 2 
cosa D му —aQ x} 3755 


and 


sina = 


yi +a 4:-0)2+ (01 fi L4 
2ao Е 2(-1) = 2-2 


The latter two equations give a = 7/4. Thus, the general solution of the homogeneous 
difference equation is given by 


1 t 
p = (52) . (c cos 21+ C5 sin 71). 


To find a particular solution of the non-homogeneous equation, we use the setting y; = A. 
Using уро = yt+1 = A and substituting the latter into the difference equation, we obtain 


1 
Е во 


This yields 
Е А = 50 
5 4- 
A — 100, 


and thus the particular solution of the non-homogeneous equation is 
yl — 100. 


Therefore, the general solution of the non-homogeneous difference equation is given by 


1 t 
y - yt +у = ба (Ci сов + C2 sin t) + 100. 

Finally, we determine the solution satisfying the initial conditions. We get 
yo = С: 1+ С. 0 + 100 = 55 => C, = —45 


1 1 1 1 
э = C1: 342: 52+ Cy 2 7/2 + 100 


1 1 
=C1- Os +100 
ото 


45 
= —— + С): 


1 
—+100=48 = С›=—59. 
2 2 2 
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Thus, we get the particular solution yr of the initial value problem 


1 : 1 f 
уР =y” + yN = —45 ба cos at 59 (бм) sin 20+ 100. 


Using this solution, we get 


1 2 1 А 
y2 = —45- (372) cos 5 — 59. (372) sin = + 100 = 70.5, 


уз = 96.5, уд = 111.25,... 


The solution is illustrated in Figure 12.4. 


Ua 


40 + 


30 7 


20 + 


10 + 


1 2 3 4 t 


Figure 12.4 The solution of Example 12.14. 


Example 12.15 Let the non-homogeneous differential equation 
и+ = Wii у + 1 
be given. To solve the homogeneous equation, we get the characteristic equation 


m? —2m+1=0 
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(notice that we have ај = 2 and ао = — 1 in the characteristic equation) which has a real 
solution of multiplicity two: 


m=m=1. 
Therefore, the general solution of the homogeneous equation is given by 
у= СНО t. Y = Сү + Cot. 


To get a particular solution of the non-homogeneous equation, we have to use case (c) ofa 
polynomial forcing term (since ао + a4 = —1+2 = 1 anda, = 2) which means that we set 


yl = At’. 


(In fact, the reader may check that both settings y — A and yl — At are not appropriate to 
determine coefficient А). Substituting у into the non-homogeneous equation, we get 


А(@ + 2)? 2240 41) — P 1 
which can be rewritten as 
A(t? + At +4) — 2A? + 2t -- D + А? = 1. 
Comparing now the terms on both sides, we get 
ДА – 2А = 1 


which gives the solution 


and the general solution of the difference equation is given by 


1 
»=ур +y; = Са + Си + 50. 


EXERCISES 


12.1 Consider the differential equation у = y. 


(a) Draw the direction field. 

(b) Solve the differential equation by calculation. Find the particular solution 
satisfying y(0) = 1 resp. y(0) = —1. 

(c) Draw the particular solution from part (b) in the direction field of part (a). 
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12.2 


12.3 


12.4 


12.5 


12.6 


12.7 


12.8 


12.9 


12.10 


Find the solutions of the following differential equations: 


(а) у =e; (b) 1+ еј)уу = e with у(0) = 1. 


Let a curve go through the point P : (1, 1). The slope (of the tangent line) of ће 
function at any point of the curve should be proportional to the squared function 
value at this point. Find all curves which satisfy this condition. 


The elasticity ғу (x) of a function f : Dy — R is given by 


2 1 
Ef (x) = 2x Inx+ 5 г 


Find function f as the general solution of a differential equation and determine а 
particular function f satisfying the equality f (1) = 1. 

Let y be a function of t and y'(t) = ay(T — t)/t, where 0 < t < T anda is a positive 
constant. Find the solution of the differential equation for a — 1/2 and T — 200. 


Check whether 


yi =x, у =xlnx and уз = — 
x 

form a fundamental system of the differential equation xy" + 2x2y" — xy! +y = 0. 
Determine the general solution of the given differential equation. 


Find the solutions of the following differential equations: 


(a) y'—2y-sinx; 

(b) y"— 2y' +y = 0 with y(0) = 1 and y'(0) = 2; 
(с) 28" +у —у=2е; 

(d) у”—2у + 10у = 10x? + 18x + 7.6 

with y(0) = 0 and y/(0) = 1.2. 


Solve the following differential equations: 


(a) y" — 2y" + 5у! = 2cosx; (b) y" + Зу" = 4y = 3e* А 


Find the solutions of the following systems of first-order differential equations. 


(a) y= ау —» 

y= yi +a; 

© у= хл -22 -» 
+у2 +y3 


че 
ws 
I 
| 
* 


3 yı —y3 
with у1(0) = 12, y2(0) = 6 and y3(0) = —6. 


A time-dependent process is described by the linear difference equation 
yr = а@)у + b(t). 


(a) Find the solution of the difference equation for a(t) = 2 and b(t) = b (b is 
constant) such that yo — 1. 


12.11 


12.12 


12.13 


12.14 
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(b) Investigate how the forcing term 5 influences the monotonicity property of the 
solution. 
(c) Graph the solution for b = 1/2 and for b = —2. 


Consider the following cobweb model. In period t, the supply of a product is given 
by y; = —5 + 2p;_1, and the demand is given by x; = 7 — pr, where p; denotes the 
price of this product at time t. Moreover, equality у, = x; holds. 


(a) Formulate the problem as a difference equation. 
(b) Solve this difference equation with po — 3. 
(c) Graph the cobweb. 


Find the solutions of the following difference equations: 


(а) уно = -—2ynui — 4и; 
(b) y2 = 251 + 8y: + 3t + 6 with yo = —2/3 апау! = 5; 
(с) yu2-2-2yu1 — yr 4. 


A company intends to modernize and enlarge its production facilities, which is 
expected to lead to an increase in the production volume. The intended increase 
in the production volume in year t + 1 is 40 per cent of the volume of year t plus 
15 per cent of the volume of year t — 1. In year t, it is assumed that 440 units are 
produced, and in year t — 1, 400 units have been produced. 


(a) Formulate a difference equation for this problem. 
(b) Find the general solution of this difference equation. 
(c) What is the production volume after five years? 


Given is the difference equation y+) = 3y; + 2t + 1. 


(a) Finda solution by using the solution formula. 

(b) Find the solution as the sum of the general solution of the corresponding homo- 
geneous difference equation and a particular solution of the non-homogeneous 
difference equation. Proceed as in the case of a linear difference equation of 
second order with constant coefficients. 

(c) Prove by induction that for t є N, the solutions found in (a) and (b) coincide. 


Selected solutions 


1 INTRODUCTION 


1.1 (a) false; (d) true; (e) false 
12 D:AABAC, E:AABAC, F:AVBVC, 
G:(AABAC)V(AABAC)V(AABAC) H:F, I:D 


14 (а) ASB; (b A—B; (с) B—4; (d ASB (е) ASB 
1.5 (a) true, negation V x? — 5х + 10 < 0 is false; 
x 


(b) false, negation V x? — 2x < 0 is true 
x 


18 (а) ТЕЕТ; b TETT; (с) LLEE (d ЕТТТ 

19 AUB-(,2,3,5,7,8,9,1, |4UB| 28, ANB= {1,3,7,9}, |AnB| 4, 
l4j26, |B| 26, 4\8 = {5,11}, |A\Bl=2 

1.10 subsets: Ø, {1}, {2}, {1,2}; |P(P(4)| = 16 

113 4 


1.14 110 students have a car and a PC, 440 have a car but no РС, 290 have a PC but no car 
and 840 students have a car or a PC 


1.15 АхА= {(1,1), (1,2), (2, 1), (2,2)}, Ax B= {(1,2), (1,3), (2,2), (2,3)}, 
Bx А = {(2, 1), (3,1), (2,2), 3,2)}, Ax C= {(1,0), (2,0)}, 
A x € x B= ((1,0,2), (1, 0,3), (2, 0,2), (2, 0, 3)}, 
AxAx B= {(1, 1,2), (1,2,2), (2,1,2), (2,2, 2), (1, 1,3), (1, 2, 3), (2, 1, 3), (2,2, 3)} 
1.16 Mı x M x M3 = {x,y |A <x<4)A(-2<y<3)A08z<5)} 
1.17 479,001,600 and 21,772,800 
1.18 56 
1.19 125 and 60 
1.20 n?—n 


1.22 (a) 1,128 

30y —7x . ap 
bero) 6xy 4- 12у?’ (0) 2а? 
(а) fre R|x>—-I]}; (0) {хєК|(х<2)у (х > 11/4) 
(с) (xeRj|(x«-1)v(0-xx2) (d íxeR|(x«-2vQ2«x«4) 
1.25 (a) (-2,25 (b) (1,5); (©) (1/2,5/2; (d) (0,00); (е) (-1,-1/3] 
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1.26 (a) —19/7; (Б) 9c/bf 
127 (а) x=15; (6) m=V2, m=-V2, 3=1, x --l 
(с) xı =—2, x2 19.36, (x3 & —4.99) 
1.28 (a) a=2; (b x=1 
1.29 (a) x=6; (b x} =1, хх» =10, хз = 0.001; (c) x=9 
1.30 (b) xı =2i, x= —2i, x3= 3i, x4 = —3i 
1.31 Imaginary axis 6 4 
—% 4 6 
Real axis 
24 4 
| z 2 9, 
1.32 д +2 = –1 + 5i, 21—22 = 3 + 31, Ziz = —6— fli, ——-—-—-i 
Z2 5 5 
1.33 (a) (2) = 0, (2) = –1, z= - i; 
(6) (2) = –1, 3@=0, 2= –1; 
(с) 922-43, Bz) = 3, 2= —/3—3i 
134 (a) 2= —і = 1(соѕ 3л +isin 37) = e; 
(b) z = —1 = 1(созл +isinz) = e"; 
(с) 2 = 3(cos 60? + isin 60°) = Зе7!/3; | 
(d) z= \/5(сов243° + isin 243°) = 4/5 . e*25i 
1.37 (a) z=V3+i, 25 = —1-+ 5/31, 23 = —V3 — і, z,21-4X5i; 
O a=, m= 3(-vV3-), z-303-)» 
1.38 а = —32, ал = 1 
2 SEQUENCES; SERIES; FINANCE 
2.4 (a) ayo, = 815; (b d=2, а =7, а, =2n+5 
2.2 (a) strictly increasing (b) unbounded; (c) аы = an + 1/2 
23 (a) a, =3, n=16; (b а= 18, g=1/3 
2.4 {ay} is strictly decreasing and bounded, lim an = —5; 
n 


{bn} is not monotone but bounded, 


{cn} is decreasing and bounded, 


lim bn = 0; 
n— oo 


lim c, = 0 
n— oo 
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2.5 (a) lim a, — 2/e; 
n—oo 
—oo for a«0 
(b lim Б, = 1/3 for а=0; 
n— oo 
оо for а> 0 
(c) lim сь = 0 (both for c; = 1 and c1 = 4) 
n—oo 
2.6 (a) shirts: ај = 2,000; a2 = 2,500; аз = 3,000; aio = 6,500; 
trousers: ај = 1,000; a2 = 1,200; аз = 1,440; aio = 5, 159.78; 
(b) shirts: 515 = 82,500; trousers: 515 = 72,035.11 
2.7 (a s= —10/7; (b s=1 


2.8 (a) series converges for —1 <x < 1; (b) series converges for |x| > 1 
29 (a sı = –12, s2 = —8, 53 = -%. 54 = – 197, series converges; 


(D = 1, s2 = 3, 53 = 2, s4 = 31, series converges; 
416 


9 
(с) я = І, 5 = HA. 53 = Ha + "2 54 = 53 + 516” series converges; 


(d) sı =0, 5 = 3, 53 = 2, $4 = 2. series does not converge. 
2.10 (a) 17,908.47 EUR; (b) 17,900.51 EUR; (с) 18,073.83 EUR; (c) is the best. 
2.11 49, 696.94 EUR 
2.12 interest rate i = 0.06 


2.13 (8) Aannuatty = 16,288.95 EUR;  Aquarterty = 16,436.19 EUR; 
A monthly = 16,470.09 EUR; 
(b) leff quarterly — 0.050945; leff —monthly — 0.051162 


2.14 5,384.35 EUR 


2.15 (a) bank A: 2,466 EUR; bank B: 2,464.38 EUR 
(b) bank A: 2,478 EUR; bank B: 2,476.21 EUR 


2.16 (a) sinking fund deposit: 478.46 EUR; 
(b) sum of interest and deposit: 550.46 EUR 


2.17 1,809.75 EUR 

2.18 (а) И = 62,277.95 EUR; (b) № = 63,227.62 EUR 
2.19 (a) A = 82,735.69 EUR; (b) Pi29 = 58, 625.33 EUR 
2.20 (a) P = 22,861.15 EUR; total payment 228, 611.50 EUR 


Redemption table for answer 2.20(a) (EUR) 


Period Annuity Amortization Interest Amount of the 
(vear) instalment loan at the end 
1 22,861.15 10,111.15 12,750.00 139,888.85 
2 22,861.15 10,970.59 11,890.55 128,918.25 
3 22,861.15 11,903.10 10,958.05 117,015.15 
4 22,861.15 12,914.86 9,946.29 104,100.29 
5 22,861.15 14,012.62 8,848.52 90,087.66 
6 22,861.15 15,203.70 7,657.45 74,883.96 
7 22,861.15 16,496.01 6,365.13 58,387.95 
8 22,861.15 17,898.17 4,962.97 . 40,489.77 
9 22,861.15 19,419.52 3,441.63 21,070.25 
10 22,861.15 21,070.18 1,790.97 0.07 


2.21 


Selected solutions 
(b) total payment: 220,125.00 EUR 
Redemption table for answer 2.20(b) (EUR) 


Period Annuity Amortization Interest Amount of the 


(year) instalment loan at the end 
1 27,750 15,000 12,750 135,000 
2 26,475 15,000 11,475 120,000 
3 25,200 15,000 10,200 105,000 
4 23,925 15,000 8,925 90,000 
5 22,650 15,000 7,650 75,000 
6 21,375 15,000 6,375 60,000 
7 20,100 15,000 5,100 45,000 
8 18,825 15,000 3,825 30,000 
9 17,550 15,000 2,550 15,000 

10 


16,275 15,000 1,275 0 


(с) 15.1 years 
(a) the project should not go ahead; (b) 1 per cent, 2 percent; (c) yes 


2.22 (a) depreciation amount in year 1: 7,000 EUR, in year 8: 7,000 EUR 


(b) depreciation amount in year 1: 12,444.44 EUR, in year 8: 1,555.55 EUR 
(c) depreciation amount in year 1: 9,753.24 EUR, in year 8: 4,755.21 EUR 


3 RELATIONS; MAPPINGS; FUNCTIONS OF A REAL VARIABLE 


3.1 


3.2 


3.3 


@ R 
51 . 1 1 
4+ e e ° 2 2 
31 3 3 
2}. 4 4 
1 e e 5 5 


123 4 5 
(b) ТЕБЕ, 


(с) R! = (02,1), (4,1), (1,3), (4,3), (5,3), (1, 5), (4,5)}; 
(d) А: по mapping; S: mapping 


(a) bijective mapping, (b) surjective mapping, 
(c) mapping, (d) injective mapping 
(a) Illustration by graphs: 


f: g: fog: 


1 1 2 ———- 1 2 ———— 1 1 

2 >< 2 3 2 3 -2 -2 

3 3 3 3 Ss 3 
(D Dp =A, Rp = В; Dg = С, Rg = (1,2) CA; "”у = Rfog = С С В; 
(c) f: bijective, 09: injective, fog: injective, gof:no mapping 


489 


490 Selected solutions 


3.4 


3.5 


3.6 


3.7 


3.8 


F-lisamapping with domain Ор: = R and range Rp-ı = [—2, оо). 


Е is a function, F^! exists. а 
G is not a function, у = +V/9 — 4.5x? is an ellipse with midpoint (0,0) and a = 4/2, 
b=3. 


(gof)(x) = 4x? + 4x - 1 (fog) (x) = 2x? – 3 
37 


(a) f and g bijective; (b) f-'@) = lnx, g (x) = =x; 
(с) fog=e™, gof = –е* 
а= 1, Ву= {хєЕ|-4<у<оо}, 10) = 1+ 04+ 
(а) ру = {хєК|х> 0), Е = (уєК| - 1<у=< 1; 
f: y=16: 6+0, =1 5х < l; 
(b ру = Е, Еу = К, fh у= 5+2 
(а) (Ps/P2)(x) = 2x8 — 2x? — 12x; (b) Р(х) = (х – D(x— 1)(х + 2) (х — 3)2х 
(9) 


504 


X1,X2,X4 аге zeroes; хз is not a zero; 


Selected solutions 
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го) = @—-DE+ DEF D@+2[x- $a + УЗ] [x- $a - 039] 


3.11 

3.12 (a) fi: Dg = 
Л : Dg, = 
fs : Dg = 


R, Ry = [-1, 1], odd; 
R, Ар 


[-1, 1], odd; 


R, Ry = [-1, 1]; 


h: 
fa: 


Dp 


R, Ар = [-2, 2], odd; 
R, Ry 


[1, 3]; 


207 exp(2x) 
84 
15 
64 
104 
44 
А B 
EN exp(z-42) 
T T T T F T A i: T 1 
22 2 4 6 -4 -2 2 4 6 


3.13 (a) 


x 


Dy = {x € R|x £0), Rf =R, 


f unbounded and even, 


f strictly decreasing for x < 0, 


f strictly increasing for x > 0 


x 


(D Dr = {xe R|x > 0}, Ry = К, 
f unbounded, 


f strictly increasing 
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(c) Dp =R, Re ={y € Rly = 5), 
f bounded from below and even, 
f strictly decreasing for x < 0, 
f strictly increasing for x > 0 


sp. 9 1 2 3 
X 


(e) Dy-R,R;—iyeRl|y 1}, 
f bounded from below, 
f strictly decreasing 


R| |x| x 2}, 

= {(уєК|0<у< 2), 
f bounded and even, 

f strictly increasing for x « 0, 
f strictly decreasing for x > 0 


34 


Dy =R, Ry = {y e Rl y = 0}, 
f bounded from below, 
f decreasing 


Заоа | Ys d 3 
4 DIFFERENTIATION 
41 (à lim f@) =a; (b) lim f (x) zl 
х х0 x 
(c) lim f(x) = 0; (d lim f(x)-2, lim f(x)=1 
x0 x 1+0 x1-0 
3 2 
42 (a) lim* Зх 2x _ 2, gap; 
x2 =2 
3 3 2 
: x —3x^ _ : ор oe . 
GE niu mop o dB eO c eges 
3 2 
(с) іт x — —oo, pole 
x2 (x — 2) 
4.3 (a) f(2) notdefined, хо = 2 вар; (b) continuous (с) jump; (d) jump 
4.4 (a) not differentiable at xo = 5, differentiable at x; = 0; 
(b) differentiable at хо = 0, not differentiable at x; = 2 
45 (a y = 6х2 — 5 – Зсох; (Б) у = (AX) +4) sinx + (x^ + 4x) cosx; 
@ у= 4x + 2x sinx + 2 sin х + sin?x — x? cos x + соѕ2х. 
(2 + sinx)? j 
(d y = 402х3 — 3x + Inx)? (62 — 3 + 4); 


4.6 


4.7 


4.8 


4.9 
4.10 
4.11 


4.12 


4.13 


4.14 


4.15 
4.16 
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(е) y =—403 + 3х2 — 8)3(3x? + бх) sin(x? + 3x? — 8)5; 

(f) у = —4(3x? + бх) cos? (x? + 3х2 — 8) sino? + 3x? — 8); 

@ у= Сове, h) у= = ; 

(а) y = sinx + cosx; (b y= cx (с) ж у= 2 (4-2/3 HE 
(а) œ= [In tan x) + "ru (tan x)*; 


() И) = (Inx+ 1-4) et cose); 


uA ®+2ух—1 1 1 3 2 
О Озе тор (= TOU. x =) 
4 
(a) f"(x)- 6cosx — 6xsinx —x?cosx; (Б) f(x) = St 
x 
x? + 12х + 12 
€) f"@)=-12 —— T OOs e+e 
exact change: 28; approximation: 22 


—10 
zu. Ud. zu des cd 
of (x) = 2v Qg (x) = 2x? ef (x) = 7% Eg (x) = 2v 


1 1 1 
1 = 1 = —; 1 = —; ] жы етед, 
of (1) = gor (100) = 5; Qg(1) = 5; @g(100) 200 


Function f is inelastic for x € (0,2) and elastic forx > 2; function g is inelastic for 
х > 2. 


When f changes from хо = 1 to 1.01, the function value changes by 0.5 per cent; 
when f changes from 100 to 101, the function value changes by 50 per cent. 
When g changes from some xo by 1 per cent, the function value always changes by 
0.5 per cent. 
ep(p) = —4(p — Up; 
demand D is elastic for p > (1 + 4/2)/2 and inelastic for 0 <р < (1+ 2)/2; 
р 1/2 (ep(1/2) = 1) 
(a) local minima at P, : (0, —5) and P2 : (2, —9), 
local maximum at Рз : (0.25; —4.98), global maximum at Рд : (—5; 1,020); 
(b) global maximum at P, : (3,4), global minimum at P2 : (—5, —4); 
(c) local and global maximum at P, : (0, 1); 
(d) local minimum at P, : (4,8), local maximum at P2 : (0; 0), 
global maximum does not exist; 
(e) global minimum at Po : (0,0), 
global maximum at the endpoint of J, i.e. P4 : (5,0.69) 
local minimum at P; : (1,5/6), local maximum at P» : (2, 2(2 — 1n 2)/3 
fora = —2/3, b = —1/6 
@ 1 (b 1/45 © bL (d 1; (e 1/6 (f —1/6 
(a Dy = R\{2}, по zero, discontinuity at хо = 2, local minimum at Pj : 
(—1/2, 1/5), inflection point at P? : (—7/4, 13/45), lim f(x) = 1, f strictly 
A> lOO 
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decreasing on (—oo, —1/2] U (2, оо), f strictly convex for x > —7/4 


84 


24 6 8 10 


(b Dy = R\{0,1/2}, zero: xo = 4/3, discontinuities: ху = 0 and x2 = 1/2, 
lim f(x) = —3/2, f strictly decreasing on Dy, f strictly convex for x > 1/2 
х 00 


(с) Dy = R\{0;1}, zero: xo = —1, discontinuities: xy = 0 and x2 = 1, local 
minimum at Рз : (3.56, 8.82), local maximum at P4 : (—0.56,0.06), inflec- 
tion point at Ps : (—0.20, 0.02), lim f(x) = —oo, lim f(x) = oo, f 
strictly decreasing on [—0.56,0) U 1,356], f strictly donvex forx > —0.2 

144 
124 
104 
E 
T 
44 
2j 
5 2 4 6 
9 4 т 


(d) Dy(x) = К, по zeroes, local maximum at P, : (2,1), inflection points at 
Р» : (Le), P3 : (3,е7!), lim f(x) = 0, f strictly increasing on (—oo, 2], 
X—-EOO 


(e) 


(f) 
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f strictly concave for 1 < x < 3 
24 


Dy = {x € R |x > 2}, по zero, local maximum at P; : (4, —3 ln 2), inflection 
point at P5 : (6.83, —2.27), lim f(x) = —oo, Hm FAO — —oo, f strictly 
хоо х—2+ 


decreasing for x > 4, f strictly convex for x > 6.83 


Dy(x) = R, zeroes: хо = 0, ху = 2, local minimum at P2 : (0,0), local 
maximum at Рз : (1.33, 1.06), inflection point at P4 : (2,0), lim f(x) = œ, 
х - CO 


lim. f (x) = —oo, f strictly decreasing on (—оо, 0] U (4/3, 00), f strictly convex 
Xx 


forx > 2 
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417 (a) f(x) =1 


2 4 
л 2 л 4 
ту 6 — 2)2 + ggg (x — 2)* + Rs, 


6 = 
Rs = -7 - sin [FC +26 - |) 825 2. 0<A<1; 
4 
Б) f@ax- E -X жы (7—1. E RS, 
(—1)"x п+1 
А О <А <l; 


С Qx4 D 1)’ 


(с) fe) = 2x — 202 - l8 ext 


=. 


R4, 


e **[—4]1 sin(2Ax) — 38cos(2Ax)], O<A<1 


418 T(-4)=14+(-)4+ 447+ 
4.19 (a) Newton’s method 


4.20 


(b) 


regula falsi 


i Ci) -4cb = 0.81873 


Xn f (Xn) 


7 (хп) 
1 —3 —3 
0 +2 —6 
0.333333 40.037037 — 5.66667 


0.339869  4-4.29153.10-5  —5.65347 
0.339877 +4.29153 -1075  —5.65347 


Newton's method 


5 INTEGRATION 


5.1 


(a) 
(c) 
(e) 
(g) 


esinx + C; 

-3 In|1 — 4x| + C; 

М? +14C; 

Ime + 1) — 2arctan e* + C; 


Xn f Gn) 
0 +2 
1 -3 
0.4 —0.336 
0.342466 | —0.146291 
0.339979 | —0.000577 
0.339881 —0.000023 
0.339877  —9.536 1077 
Xn f (xn) 
5 0.3905620 
4.5117973 0.0051017 
4.5052428 0.0000010 
4.505241496 1.6. 10-10 


(b) Iünxy +С; 

(d) le ЕВ С; 

(f 152 + I(x? — 2) +С; 
1 : 3 

h) arcsin ~=x+C; 

h 3 25 


5.2 


5.3 


5.4 


5.5 
5.6 


5.7 
5.8 


5.9 


Selected solutions 
; 1 : А D 2 
0 Fane eG ae te 
1 1 
(k) |! | tan 5 | + jam 4 +С 


(a) e@?-2x+2)+C; (Ы) le(Ginx-cosx)4 C; 


(c) xtanx+In|cosx|+C; (d) 1 (cosx sin x + x) + Ci 


xi 3 


() жх y +С; (f) xlnG2 +1) — 2x + 2arctanx + C 

(a) 3; (b 4-21n3; (c) 2/3; (d) 10/3; 

© 21020-1; © 1/5 @ л/4 

(a) total cost C = 9,349.99; total sales 5 = 15,237.93; 
total profit P = 5,887.94 

(b) average sales Sa = 3,809.48; average cost Ca = 2,337.49; 

(c) P(t) = 10,000e~*(—#2 — 2t — 2) + 20,000 — 1,000(4t — 2 In (e! + 1) 
4-2, 000 In 2 

(a 4 (b A= 32.75 


(a) graph ог до = 8, to =4: 


ае | 


2 4 6 8 10 12 14 16 18 20 
t 


2 
(b) ar (1 DES) (с) х=Т/2 


(а) 0.784981; (b) 0.783333; (c) 0.785398; exact value: л /4 
(a) 1; (b 1/2; (o) 1; à>0 ; Xd) 2/15 ASO 


0; A=0 0; A=0 
—o0; А< 0 —00; А<0 
(е) 4; (f) does not exist 
1,374.13 
PS=7; CS=4. 
T 
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6 VECTORS 
1 5 
61 (a) a+b—c=] —5 |, а+3р = | -11 |, 
—9 —7 
—3 —7 
b—4a+2c=]| —4 |, a+3(b—2c)=] —23 |; 
14 —43 


(D a>b, c>a, с>р; 
(с) a'-b=0, аТ.с= 0, bl'.c— –18; 
a,b orthogonal; a,c orthogonal; Z(b,c) ^ 126.3? 


0 —36 
(d (a'-b)e=| 0 |], афт. е) = | -18 |; 
0 18 


(e) |b+e| = 29, |bl+ lel = V21+ 44, |bT.c| = 18, |Ы = 214/44 
62 а=В-2, BER 
6.3 (a) 419; 

(b) a>b: М = (а, а) € В |а > bı Aap > bo; 


lal = Ibl: = (баа) € R? | a? +@ > b? + 23) 


6.4 all the vectors are linear combinations of a! and a2; 
(0, 0.5)? is a convex combination 
йт. Wate Тыз ЁЗ 
65 a = за + gat qa. 


44 (1,4) 


6.6 no 


6.7 по; по basis 
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3 1 0 1 
6.9 (a) з }=o{ о |+з[ 1 |+} о |; 
—3 3 0 —1 


(b) bases: a, al, a? апда,а!,а?; 
(c) b=2a!—a?+a 


7 MATRICES AND DETERMINANTS 


71 (a) A= 2 |, no equal matrices; 


3 
4 
1 
(b) A+D=( 


(c) A+3(BT – 2р) = 


1.5 —0.5 —2 1.5 0.5 0 


73 (a) ав= (33 2 


E 


2 0 15 0 -1 -15 
72 A=| 0 1-05]|«| 1 о -05 
23 27 1 


45 51 58 
A a-(5 18 2) 


6 —9 —12 —15 
12 18 24 30 
(c) AB = 10; BA= de e eu 


4 6 8 10 
—8 4 —12 


2 -4 
74 АВТ={ -1 2 is, 
3 —6 
2 -4 
21-1 0 0 
эбе " x —1 E 0 ) = авт 
3 


7.5 (a) ACB = Da, 


29 0 1 0 E 2 
b) (40B-2| -14 -12 7 —24 |. o 1= 1 178 |. 
—10 9 5 18 4 154 
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7.6 


0 0 
А? = 


14 46 
0 35 


4 = 


0 0 
0 0 


B* =I, keN; B%-! =B; 
В: 74,000 units and R2: 73,000 units; 


0 0 
0 0 
0 0 
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275 EUR for Е} and 156 EUR for Р». 
—1 4 2 0 

A12 = 0 я. a= (4 1 

|Aj2}=—1, |A22| = 2; 

cofactor of ај: —18, 

|A| = —33 

—6, (b) 


x2; 


7.8 (a) 


(b) 
(c) 
(d) 
(a) 
(a) 


xj 2; 


7.9 
7.10 
7.11 
7.12 


5, (c) 70, 
(b xı = —2, 


x = 3, 


(d) 


х =0 


8, 


хз = —5 
the zero vector is the kernel of the mapping 


-1 
10 


ua —14 


—1/11 


—4/11 
4/11 


3/11 —3/11 
1/11 10/11 
—1/11 1/11 

= =] 1 
1/2 1 ope 
—4 —3 2 

-3/2 =1 1/2 


(а) 47 


(à рх! 


5 21 
—2 —9 
1 $ =з 
0 1 -2 


0 0 1 


—41 


2 

0 

5 

2 
=2 

1 16 

0 

0 

0 


ооо о 


—23 
—7 


cp pat ys. Sh 
(АВ) = BA = 17 


(a) Х = ВТА-!; 
(d) X = С-!АВ-!; 
(D х UT —-A)x=y; 


(a) 4= (1 – 4)р 


(е) X=[(A+ 
(с) по; 


with A = (aij) = 


ооо == 
ооо о о 
ооо о о 
оомо о 
ON ONE 


cofactor of a21: —3, 


—10 


(b Х= В(4 + 21)-!; 


оо о о 
ооо о 
ооо о 
ооо о 


keN 


31 EUR each unit of $1, 29 EUR each unit of Sy and 20 EUR each unit of $3; 


cofactor of a31: 12; 


(е) 0, ( (D! 


4 
9 


—1 
—1 
4 


—8 


; © = 
—1 


(с) X = А-!СВ-!; 
ADCT]-! 


(d) у є В > х = (1 – 4)-!у є Rt 
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(b) p=(U-A)'q; 
(с) г= Вр = В(1 – А) !q 


1 3 0 0 16 
103 0 0 0100 5 
withB = (bj) =| 5 1000], Ч—-А)!=| 0012 4 |, 
0000 7 0001 2 
0000 1! 
гї =( 660, 1740, 70) 
8 LINEAR EQUATIONS AND INEQUALITIES 
81 (a) x1=1, x2 =2, хз=0; (b) no solution; 
(c) no solution; (d x1 =5, х= 4, әз = 1 
8.2 (а) (1) Хр = X2 = X3 = x4 = 0; 
Gi) x =—-9t, х= 1, x3 =—3t, x4=t, teR; 
WG x =3, 2553). x3=1, Xy——L 
(1) х=—6—9, x2 =8+1lt, x3 =—2-3t, x4=t, ЄК 
83 (a) х= 5+3, m= 3-31, хз = 13 + Br, х= teR 
x = 24 ét, 3 =5— Ht X3 — t, x4 = l4 р, tem 
(D xı = 4—34 – 22, x2——1-2t —0b, x3-—tj, x4 =143h, x5 =h; 
х= 6-F2tj — 7t, x2— tj, B=h, x4 = —2—3t, + 6h, 
х5 = —l — й +260; 
t,o eR 
31. _ 8a — 46 _ 3а + 12 _ 7 
Ré eg. A л улксн да ле 
8.5 the cases ‘no solution’ and ‘unique solution’ do not exist for any A. 
A=2: x= —42- 25, y,z Є В; 
А2: х= 25, yeR; z=0 
8.6 (а) nosolutionfo:a —0, bER ог а + 1/3, Б= 1; 
unique solution for: а Z0, bÆ 1; 
general solution for: a = —1/3, Б = 1 
(D x1} =24+t, x2 =-t, хз = 3, x4=t, tER; 
(c) xj = 6, x2 = 0, x = 1, ха = 4 
8.7 х= 1, yzÉ22, 223 for a £0, |а| = |b|; 
x=1, y=t, z=5-t, teR, fora= 0, b #0; 
x=4-t, Pa 25 Zt; teR, fora #0, a= b; 
х=-2 +1, y=2, z=t, teR, fora #0, а= —b 
8.8 ker(A) = (—3t/2, —t/2, t)', te R 
8.9 systeml: xı =1, x2=2, x3= 0; 
system 2: xı =5, x2=0, x3=3 
2 -1 -1 1 
—4 4 1 
8.10 (а) A! = 1—2 —1 |; (с) С! = ое Monde 
1 1 1 5 —4 —3 2 
2 —3/2 -1 1/2 
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oif -6 78 515 11-1 8). 
8.11 (a) x=$( 1 59 д0 |) 9 X53 XEM 
—10 10 —2 
e) х= ( 15—11 i 

5.79 17.80 14.19 10.18 

28.99 29.67 9.46 30.54 

11.59 23.73 18.52 20.36 

11.59 35.60 14.19 25.46 

8.13 a belongs to the set; b does not belong to it. 


814 (b) 


8.12 (b X= 


100 200 300 400 500 * 600 
x 0 200 0 100 
(c) (5)-«(8)*x«( 0 ) (oo ) + (о). 


8.15 10 


region 


unbounded 


(у ы Jems Joma Jews )on(1) 


A = 1, ài = 0, i= 1,2,3,4, Ui u2 = 0 
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0 0 3/4 5/9 
= М№| 0 + А | 0 + Аз 0 + А | 7/9 


ХІ 
8.16 (а) | x 
X3 


0 0 К 
Zu +| 1], Ул=1, X90 1=1,2,...,6; 


10/7 0 iz 
xi 3 0 0 0 
(b) x2 = M 0 + ho 0 + Аз 3 + А 12/5 
x3 0 0 0 3/5 
0 5 
CENE. MM No 8 
3/5 i-l 


9 LINEAR PROGRAMMING 


9.1 (a) 
(c) infinitely many solutions (d) no optimal solution 
92 z = 10x, + 6x; — max! 
xı < 100 
хә = 30 
—3x, + x) € 0 
xi + 4x) x 200 
Xpx) > 0 
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9.3 


9.4 


9.5 


9.6 
9.7 


9.8 


—_ 


20 40 60 80 100 


optimal solution: x; = 80, x2 = 30; z = 980 


(a) Z= =z = –х + 2х) —x3 max! 
s.t. XQ + x2 + x3 + м Le 
—3xq + x2 — x + x5 = 4 
x > 0, j=1,2,3,4,5 
(D Z= =z = =x} — 2x2 + 3x3 — х x4'— max! 
st I +x — 5X3 + ixl — ix = 
=] + 2з + Ж — X1 + х = 
—2хї ee uo + xg 


IV. Il 


* * жж 
Xis Х2,Х3,Х4,Х4 »X5, X6, X7 


(a) x128/5, x2—23/5; z= 11/5; 
(b) infinitely many optimal solutions: z — —11; 


1 3 
xa $)ea-»(1). 0<A<1 


(a) х= 0, x» =10, x3—5, x4 =15; z= 155; 
(b) а (finite) optimal solution does not exist; 


— 
оо о Б 


xı = 80, х= 30, х3 = 20, х= 0, х5 = 210, х= 0; z= 980; 


(а) a (finite) optimal solution does not exist; 


(D z= 24; 
xi 5.5 10 
X2 =A! 45 |T-uü-2) 0 Я О < ла <1 
X3 9.5 14 
(c) the problem does not have a feasible solution. 
dual problem of problem 9.5 (a): dual problem of problem 9.7 (a): 
w = 400us + 30и + 5u7 — min! w = 10us + 4ug — max! 
s.t. 20u5 + ug > 7 st —3u4 + us 
10u5 + ug > 4 ид + 2us + U6 
1l2u5 + ug + иу > 5 4u4 + + 3ug 
16и5 + ug > 6 uj = 0; j= 4,5,6 


uj > 0; ј = 5,6,7 


IA IA IA 


s.t. 
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dual problem of 9.7(c): 
w = 4и5 + 9и + Зит — min! 
us + 2ug > 2 
us + 3ug + ит > —1 
—us — шщ + 2и7 > 1 
—us —  2ug = 0 


us Є R; ив Є R; из < 0 


9.9 (а) 


(b) 


primal problem: xı = 0, x2 = 9/2, хз = 1/2, xa = 17/2; z = —18; 
dual problem: и = 1, иб = 4, ит=1; w = —18; 

primal problem: xı = 17, x2 = 0, x3 = 9, x4 = 3; z = 63; 

dual problem: us = 25/4, ug = 3/4, ит = 35/4; w = 63 


9.10 optimal solution of problem (Р): x1 = 4, x;—2; z= 14; 
optimal solution of problem (D): из = 2/3, ш = 1/3, u5—-0; у= 14 


9.11 (c) 
(1) 
(2) 


optimal solutions: 
x1—0, х= 100, x3—50, x4—40; z= 190; 
х= 50, х= 0, хз = 100, жщ = 40; 2 = 190 


10 EIGENVALUE PROBLEMS AND QUADRATIC FORMS 


10.1 A: A123, э =—2, ses. ges 2j 


B: Ay=3+i, №=3 – і, Bea iue) sess 


C: м= 1, 22-2, AZ3=-1, 


1 2 
х= 5 х2 = bh —1 Я х= в 1 
1 2 
0 
à= 1, А№= 3 = 2, х= 1 5 х2 = —2 |; 
1 


ti, b,b € R; tj,0,t3 £0 


10.2 (a) 
(b) 


(c) 


10.3 (a) 


Ад = 1.1 greatest eigenvalue with eigenvector xT = (3a/2,a), a # 0; 

based on a production level x = 3a/2 and хә = a (a > 0) in period t, it follows 
a proportionate growth by 10 per cent for period t+ 1: x1 = 1.65a, x2 = 1.la; 
xı = 6,000; x2 = 4,000; subsequent period: xı = 6, 600; x2 = 4, 400; 

two periods later: x = 7,260; x? = 4,840 

A: А = 1, À2 = 3, Аз = —2, Ад = 5, 


x =h 


оо о н 
о о н m. 
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B: ù= i =4, 3= 
0 
х!2 = + to 1 =h 1 j 
1 —1 
1,0,8, € k \ {0} 
10.4 xT Bx = x"B,x with B, = ( i ru | B, is positive definite 


10.5 (a) AtAyg=2+JS2, В:А = (3 + 4/17)/2, 
С.м = —4, № = 2, 23-23, Р: эл = 1, з = 3 + V8; 
(b) AandD are positive definite, В and C are indefinite 
10.6 (a) d1—3, a=1, азє R; 
(b) any vector x! = (t, t, 0)T with t € R, t Æ 0, is an eigenvector; 
(с) аз>1; (d) no 


11 FUNCTIONS OF SEVERAL VARIABLES 


11.1 (a) f(xi,x2) = Jxix2 


Ec 
4 m 
3 24 
2 m 
1 4 
0 . 
1 2 3 4 5 
x 
isoquants surface in R? 


(b) domains and isoquants 


(i) (ii) 


= +6- 
Ру = {(x,y) e R12 +y? < 9} Dy = {(х,у) e Rix £y} 


(iii) 


11.2 (a) 


11.3 


11.4 


11.5 


11.6 
11.7 


11.8 


(c) 


(e) 


(a) 


(a) 


(a) 
(b) 
(c) 


Selected solutions 507 


. De. 
fe = 2x sin’y, (b) f yx И р, 
fy = 2x? siny cos y; fy = 2yx9? Inx; 
= ухо) кушу, (d £-4. 
fy =X х + xy® D; h= 25: 
fe P Qaxm, (f) л = 20, 
x? +4 Tx] 
радуе, pareman 
T +x +x 
f = 4xze ty +2. fa = LL X3 = 
xt +x +x 
1,200, 000 32, 000, 000 
C,—1200— 2752 Gy = 800-2 —; 


x X 
Cx(80) = — 67.5, C,(120) + 36.67, C,(160) = — 450, C,(240) ~ 244.5 


Дах = foxi = 6x2x3, Jas = fax = 9x2x3, Доху = fax = 18x) x2%3, 


fux, = 6x -xj?, Foxy = 6x13, Fax = 18xix2x3 — T 
4у 4x2 2xy + 2 
fa = 2—9, Љу = > Љу = Ља = 2—5; 
(1 — xy? УУ (1- ху)? 7 UT (1 ху) 
4ху -2G? + у>) 
fe = a Љу = fax fy =hx = Oo 
(х2 - yy Xy y = Jy (х2 — у2)2 
grad f (1,0) = (a, b)", grad f (1,2) = (a, Б); 
grad f(1,0) = Q, 1)T, grad f (1,2) = (3,3.6)'; 
gradf (1,0) = (—1/2, 0)T, gradf (1,2) = (-1/2, —1)T 


the direction of movement is —grad f (1, 1) = (—1.416, —0.909)T 


(a) 
(c) 


dz = i cos 5; dx — 5 соз 5; dy; (b) dz — Qx + y?) dx + (ху + cosy) dy; 
y 
dz=(2xdx+2ydye""; — (d = аъ 


surface: S = 282, absolute error: 4.08, relative error: 4.6 per cent 
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11.9 


11.10 


11.11 


11.12 
11.13 


11.14 


11.15 


11.16 
11.17 


11.18 


11.19 


11.20 
11.21 


11.22 
11.23 


d. d. 
(a) a = 2x e” v + хе? 52; 
(D @ 2! = 24092, + хе? = 605, 
(1) 2 = 2x,092 + х2е221 = 8e? (1 tin ) Int; 
(с) (i) >=; z-60, 
Gi) 2= ne^, z-8e (1 + г)! 
ое о 8 О ei 
al 7-3 таш шу 91360) = 5 
(a) grad C(3,2, D) = (8, 6, 10)", 9С (Po) = 13.36; 
percentage rate of cost reduction: 5.52 per cent; 
(b) The first ratio is better or equal 
pf, = 0.002; Pf x. = 0.00053; Ef xı = 0.2; Ef xı = 0.079 
(a) partial elasticities: 
a x1 (3x17 + 2x5?) euo xix) + 3x5?) 
d 2603? + 21х22 + x93)’ n 2(x? + 2xix2? + x?) 
f homogeneous of degree r — 3/2, r1; 
(b) f is not homogeneous 


z bx . /. 3x cos 3x — sin 3x. 1 yX — 1 + 2x2y) 
a = o) 3; (p) у= 2499524 TIO, (e) у 
(a) у P (b) y 2 (c) y Ray) 


x 
Vl=r; forr £0:r=x2+y%, р = arctan Č ( р.и = arccos 229 
x x* +y 


local maximum at (x;,y,) = (1/2, 1/3); no local extremum at (x2, уз) = (1/7, 1/7) 
(a) local minimum at (x1, y1) = (0, 1/2) with zı = —1/4; 
(b) local minimum at (x1, y1) = (1, In 3 with z; = 1 — n4 
stationary point: (x9, y9) — (100, 200); 

local minimum point with C(100, 200) — 824,000 

local minimum point: x! — (1, 0, 0), 

x—(1, 1, D, x? = (1, —1, —1) are not local extreme points 
no local extremum 


stationary point x — (30, 30, 15) is a local maximum point with P(30,30,15) — 
26, 777.5 


(a) у= 10.05x — 28.25; (b y(18) = 152.64, y(36) = 333.5 
Р: maximum; P2: minimum 


67 max 


(1,5) 


11.24 


11.25 
11.26 
11.27 


11.28 


11.29 
11.30 
11.31 
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(a) local maximum at (х,у) = (4,0) with zı = 16; 
(b) local minimum at (x},x},x}) = (—1/12, 37/12, —30) with zı = —73/48 
local maximum point; values of the Lagrangian multipliers: А = —13 апал = —16 
length = breadth = 12.6 cm, height = 18.9 cm 
local and global maximum of distance Dmax = 84/9 
at (x1,91) = C2, 248, (2,92) = C$. - 25) 
local minimum of distance Р min = 3 at point (x3, уз) = (—1,0); 
local and global minimum of distance D» min = 1 at point (x4, ул) = (1, 0) 


stationary point (x1,x2,x3;4) = (25, 7.5, 15; 5); 
local minimum point with C(25, 7.5, 15) = 187.5 


136/3 
32/3 
3/8 


12 DIFFERENTIAL EQUATIONS AND DIFFERENCE EQUATIONS 


12.1 


12.2 
12.3 
12.4 
12.5 
12.6 


(a), (c) 


(b) =e 

(а) у= Шіе+Ср (Ы) Y=1t2amige 

ку (х= 0) -у+1= 0 

general solution у = Cx’, particular solution у? = x* 
y = Ct! e—t/2 

The functions у], y2, уз form a fundamental system; 


y= Cix + CoxInx + C3 4 
(а) y=Ce*—Lcosx—Zsinx; (Б) у? =el +x; 


2 


(с) у= Cie * + Ce"? + e; (d y= —e"cos3x +x? +2.2x+1 
1 2 
(а) у= Cı + e (Сә соз 2х + Сз sin 2x) + Reese + 5 sin x; 


1 
(b) у= Сех + Coe + Cyxe 7 + 35 
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12.9 (a) 


12.10 (a) 
(c) 
12.11 (a) 
(c) 
12.12 (a) 
(c) 
12.13 (a) 
(c) 
12.14 (a) 


yı =Cie*cosx+ Coe" sinx (Б) yl = Зе2х +9 


уз =Cre*sinx+ Cre cosx yy = —2e* -8e* 
yn =e” —l6e*+9 


у = (1+ b)2! — b; (b) strictly increasing for b > —1; 


80 ъ= 1/2 


—60 b = -2 0 
Pry = —2р+12; (b) pi = —(—2)'+4 with po = 3,р = 6,p2 = 0,...; 


supply 


-10- demand 


2 2 1 2 
» =? (Ci сов ул. + Cosin ул); (b) ур = 4! – (-2) 3! 3 


1 
= Ci(—1)! + Cot(—1) + — - 4 
yı 1(-1)' + С Y + тоб 


ун2 = 14у + 0.15у; (D ж = dS + (0.1); 
1,518.76 units 


у = 3yo + УМС! 4031; (b) у= Зу +3 - 1-1 
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Index 


€-neighbourhood of point 387 
n-dimensional space 231 

nth derivative 163 

nth partial sum 71 


absolute value 37 

amortization installment 90, 93 

amortization table 93 

amount of annuity 85 

annuity 85, 90; ordinary 85 

antiderivative 197 

apex 129 

approximation: by rectangles 215; by 
trapeziums 215 

Argand diagram 49 

argument 117 

artificial variable 338 

augmented matrix 291 

auxiliary objective function 350 


back substitution 301 

basic solution 293 

basis of vector space 245 
Bernoulli-l' Hospital's rule 178 
binomial coefficient 28 
bordered Hessian 426 
break-even point 172 


canonical form 293, 339 
Cartesian product 24, 25 
Cauchy-Schwarz inequality 237 
chain rule 160, 398 

characteristic equation 369 
Cobb-Douglas production function 383, 404 
Cobb-Douglas utility function 431 
cobweb model 476 

coefficient 288 

coefficient of the polynomial 126 
cofactor 265 

column vector 230 

complex number 47 

component of vector 230 
composite mapping 114, 272 


composition 114 

composition of relations 108 
conclusion 3, 9 

conjunction 2 

constant-factor rule 199 

constrained optimization problem 424 
consumer surplus 225 

continuous future income flow 222 
contradiction 1, 5 

convex combination 240, 310 

convex polyhedron 334 

coordinate of vector 230 

cosine function 141 

cotangent function 141 

Cramer's rule 269 

criterion: Leibniz 77; quotient 78; root 79 
critical point 170 


debt 90 

definite solution 448 

degeneration case 351 

degree of freedom 292 

demand-price-function 172 

dependent variable 117 

deposits 85 

depreciation: arithmetic-degressive 101; 
degressive 101; digital 102; 
geometric-degressive 102; linear 101; 
table 101 

derivative 156; directional 399; partial 387; 
second 163 

determinant: Casorati’s 473; Wronski’s 452 

determinant of matrix 264 

difference equation: linear 472, of the first 
order 474, of second order 478 

difference of vectors 233 

difference quotient 155 

difference set 17 

differential equation 444; homogeneous 451; 
non-homogeneous 451; ordinary 444; with 
separable variables 447 

differential of function 164 

differential quotient 156 


514 Index 


dimension of a vector space 245 
dimension of matrix 255 

direction field 445 

disjunction 2 

domain of the function 110, 117 

domain of the mapping 110 

double integral 436 

downward parabola 128 

dual problem 358 

duality 357; economic interpretation 361 


effective rate of interest 83 
eigenvalue 368 

eigenvalue equation 369 
eigenvector 368 

elasticity 166, 183 
elementary transformation 292 
empty set 16 

entering variable 342 

equal matrices 256 

equal vectors 232 
equivalence 4 

equivalent transformation 292 
Euclidean distance 236 
Euler’s theorem 404 

extreme point 310 


factor of the polynomial 131 

Falk’s scheme 260 

feasible region 309 

first-derivative test 170 

first-order differential equation 445 

first-order partial derivative 390 

forcing term 451 

function 110; algebraic 136; 
antisymmetric 124; arccosine 143; 
arccotangent 143; arcsine 143; 


arctangent 143; bounded 123; bounded from: 


above 123, below 123; circular 140; 
complementary 453; concave 125, 175; 
constant 126; continuous 151, 387; 
continuously differentiable 156; convex 125, 
175; cubic 126; decreasing 121, 168; 
differentiable 156; elastic 167; even 124; 
exponential 137; homogeneous of 

degree k 403; implicitly defined 405; 
increasing 121, 168; inelastic 167; 

inside 120; left-continuous 154;linear 126; 
logarithmic 138; non-decreasing 121, 168; 
non-increasing 121, 168; odd 124; 

outside 120; periodic 125; propositional 7; 
quadratic 126; rational 134, improper 134, 
proper 134; right-continuous 154; strictly 
concave 125, 175; strictly convex 125, 175; 
strictly decreasing 121, 168; strictly 
increasing 121, 168; symmetric 124; 
trigonometric 140 


function of a real variable 117 

fundamental system of the differential 
equation 453 

fundamental theorem of algebra 130 


Gauss—Jordan elimination 293 

Gaussian elimination 293, 299 

general solution of the differential equation 445, 
448, 453 

general solution of the system of linear 
equations 289 

generalized chain rule 398 

geometric interpretation of an LPP 330 

Gini coefficient 224 

global maximum 169, 410 

global maximum point 169, 410 

global minimum 169, 410 

global minimum point 169, 410 

global sufficient conditions 434 

gradient of function 392 


Hessian matrix 411 

higher-order derivative test 171 
higher-order partial derivative 391 
Horner’s scheme 132 

hypothesis 3 


identity matrix 257 

imaginary part of the complex number 47 

implication 3 

implicit-function theorem 408 

independent variable 117 

indeterminate form 178 

inflection point of function 176 

initial value problem 448 

inner product 235 

input-output model 277 

integral: definite 210; improper 219, 221; 
indefinite 198 

integrand 198 

integration by parts 204 

integration by substitution 200 

interest 80; compound 81; simple 80 

inverse 273 

inverse demand function 172 

inverse element 244 

investment project 97 

isoquant 384, 445 


Jacobian determinant 407 


Kepler’s formula 217 
kernel 271 


Lagrange multiplier method 425 
Lagrange’s theorem 425 
Lagrangian function 425 
Lagrangian multiplier 425 


law: associative 6, 19, 235, 259; 
commutative 6, 19, 235, 259; distributive 6, 
19, 235, 259; of de Morgan 6 

leading principal minor 379 

leaving variable 342 

length of vector 236 

Leontief model 277 

limit of sequence 65 

limit of function 148; left-side 149; 
right-side 149 

linear combination 240 

linear differential equation of ordern 451 

linear objective function 329 

linear programming problem 329 

linear space 244 

linear substitution 200 

linearly dependent vectors 241 

linearly independent vectors 241 

loan: amortized 90 

loan repayments 90 

local maximum 169, 410 

local maximum point 169, 410, 423 

local minimum 169, 410 

local minimum point 169, 410, 423 

local sufficient condition 426 

logarithmic differentiation 162 

Lorenz curve 224 


mapping 110; bijective 112; identical 116, 273; 
injective 112; inverse 114; linear 271; 
surjective 112 

marginal 156 

marginal cost 213, 389 

marginal function 156 

marginal propensity to consume 159 

market price 225 

matrix 255; antisymmetric 256; diagonal 257; 
indefinite 377; inverse 273; invertible 273; 
lower triangular 257; negative definite 377; 
negative semi-definite 377; orthogonal 263; 
positive definite 377; positive 
semi-definite 377; symmetric 257; upper 
triangular 257 

matrix difference 258 

matrix product 260 

matrix representation 288 

matrix representation of an LPP 330 

matrix sum 258 

mean-value theorem 184, 214 

method: of undetermined coefficients 456 

minor 265 

mixed LPP 360 

monopolist 172 

monotonicity of function 168 

mortgage 94 

multiplier-accelerator-model 480 
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necessary first-order conditions 411 

negation 2 

negative integer 32 

neutral element 244, 258, 261 

Newton's method 189 

Newton's method of second order 190 

Newton-Leibniz's formula 210 

non-negativity constraint 309, 329 

norm 236 

number: irrational 32; natural 32; rational 32; 
real 32 


objective row 340 

one-parametric set of solutions 299 
one-to-one mapping 112 
onto-mapping 112 

operation: logical 1 

optimal solution 330 

optimality criterion 341 
optimization by substitution 425 
order of matrix 255 

order of the differential equation 444 
orthogonal vectors 239 


parabola 128 

partial differential 394 

partial elasticity 403 

partial rate of change 402 

particular solution 448 

Pascal's triangle 29 

payment: annual 85; periodic 85 

period of function 125 

periods for interest 85 

permutation 26 

pivot 300, 342 

pivot column 342 

pivot element 300, 342 

pivot row 342 

pivoting procedure 294, 339 

polar form of complex number 49 

pole of second order 222 

polynomial 126 

polynomial function 126 

power function 136 

power set 16 

premises 9 

present value of annuity 86 

price-demand function 195 

primal problem 358 

principal 80 

producer surplus 226 

production function 383 

profit function 416 

proof: by induction 13; direct 10; indirect 10, 
of contradiction 10, of contrapositive 10 

proportional rate of change 166 


516 Index 


proposition: compound 1, 5; existential 8; 
false 1; open 7; true 1; universal 8 
Pythagorean theorem 238 


quadratic form 376; indefinite 377; negative 
definite 377; negative semi-definite 377; 
positive definite 377; positive 
semi-definite 377 


radian 141 

range of the function 110, 117 

range of the mapping 110 

rank of matrix 290 

rate of interest 80 

real part of complex number 47 

rectangle formula 248 

rectangle rule 297 

redemption table 90, 94 

regula falsi 191 

relation: binary 107; inverse 108; 
reflexive 107; symmetric 107 

remainder 130 

remainder theorem 130 

rentability 98 

return to scale 403; decreasing 404; 
increasing 404 

Riemann integral 210 

right-hand side 288 

right-hand side vector 330 

Rolle’s theorem 184 

root function 136 

root of the function 128 

row vector 230 


saddle point 412 

Sarrus’ rule 265 

scalar multiplication 233, 258 

scalar product 235 

sequence 61; arithmetic 62; bounded 65; 
decreasing 64; geometric 63; increasing 64; 
strictly decreasing 64; strictly increasing 64 

series 73; alternating 77; geometric 75; 
harmonic 74 

set 15; cardinality 15; complement 17; 
convex 310; disjoint 17; finite 15; 
infinite 15; intersection 17; union 16 

set of feasible solutions 309 

set of solutions 289 

shadow price 361 

short form of the tableau 341 

simplex algorithm 343 

simplex method 339 

Simpson's formula 218 

sine function 141 

slack variable 315, 337 


smallest subscript rule 344 

solution of systems of linear equations 289 

solution of systems of linear inequalities 309; 
degenerate 314; feasible 309; 
non-degenerate 314 

solution of the differential equation 445 

solutions: linearly dependent 452, 473; linearly 
independent 452, 473 

square matrix: non-singular 269; regular 269; 
singular 269 

standard form of an LPP 336 

stationary point 170, 411 

Steinitz's procedure 248 

straightline 127 

subset 16 

sufficient second-order conditions 411 

sum of the infinite series 73 

sum of vectors 233 

sum-difference rule 199 

surface 384 

surplus variable 337 

system: consistent 291; homogeneous 289, 302; 
inconsistent 291; non-homogeneous 289 

system of linear equations 287 

system of linear inequalities 308 


tangent function 141 
tautology 5 

Taylor polynomial 187 
Taylor's formula 187 
total differential 394 
transition matrix 262 
transpose of matrix 256 
transposed vector 230 
trivial solution 292 
truth table 2 


unit vector 232 
unknown 288 
upward parabola 129 
utility function 431 


variable 288; basic 293; non-basic 293 
vector 230 

vector space 244, 259 

Venn diagram 17 

Vieta's theorem 133 


withdrawals 86 
Young's theorem 392 
zero element 244 


zero matrix 258 
zero of the function 128 
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